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Bell's 1964 theorem, which states that the predictions of quantum theory cannot be accounted 
for by any local theory, represents one of the most profound developments in the foundations 
of physics. In the last two decades, Bell's theorem has been a central theme of research from a 
variety of perspectives, mainly motivated by quantum information science, where the nonlocal- 
ity of quantum theory underpins many of the advantages afforded by a quantum processing of 
information. The focus of this review is to a large extent oriented by these later developments. 
We review the main concepts and tools which have been developed to describe and study the 
nonlocality of quantum theory, and which have raised this topic to the status of a full sub-field of 
quantum information science. 
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I. INTRODUCTION 

In 1964, Bell proved that the predictions of quantum 
theory are incompatible with those of any physi c al the - 
ory satisfying a natural notion of locality 1 ( Belli . Il964h . 
Bell's theorem has deeply influenced our perception and 
understanding of physics, and arguably ranks among the 
most profound scientific discoveries ever made. With the 
advent of quantum information science, a considerable 
interest has been devoted to Bell's theorem. In partic- 
ular, a wide range of concepts and technical tools have 
been developed for describing and studying the nonlocal- 
ity of quantum theory. These represent the main focus 
of the present review. Hence we will not discuss, at least 
not directly, the extensive literature dealing with the con- 
ceptual implications of Bell's theorem from a traditional 
foundational perspective. Skipping shamelessly many 
important contributions before and after Bell's ground- 
breaking discovery, the most notable on e being the fa- 
mous Einstein-Podolosky-Rosen paper (jEinstein et ali . 
Il935h . we start straightaway with the mathematical for- 
mulation of a locality constraint in the context of certain 
experiments involving separate systems, and its violation 
by the predictions of quantum theory. 



1 To avoid any misunderstanding from the start, by "locality" we 
do not mean the notion used within quantum mechanics and 
quantum field theory that operators defined in spaceliko sepa- 
rated regions commute. Bell's notion of locality is different and 
is clarified below. 
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FIG. 1 Sketch of a Bell experiment. A source (S) distributes 
two physical systems to distant observers, Alice and Bob. 
Upon receiving their systems, each observer performs a mea- 
surement on it. The measurement chosen by Alice is labeled 
x and its outcome a. Similarly, Bob chooses measurement y 
and gets outcome 6. The experiment is characterized by the 
joint probability distribution p(ab\xy) of obtaining outcomes 
a and b when Alice and Bob choose measurements x and y. 



A. Non-locality in a nutshell 

In a typical "Bell experiment" , two systems which may 
have previously interacted - for instance they may have 
been produced by a common source - are now spatially 
separated and are each measured by one of two distant 
observers, Alice and Bob (see Fig. [TJ. Alice may choose 
one out of several possible measurements to perform on 
her system and we let x denote her measurement choice, 
for instance x may refer to the position of a knob on 
her measurement apparatus. Similarly, we let y denote 
Bob's measurement choice. Once the measurements are 
performed, they yield outcomes a and b on the two sys- 
tems. The actual values assigned to the measurement 
choices x, y and outcomes a, b are purely conventional; 
they are mere macroscopic labels distinguishing the dif- 
ferent possibilities. 

From one run of the experiment to the other, the out- 
comes a and b that are obtained may vary, even when 
the same choices of measurements x and y are made. 
These outcomes are thus in general governed by a prob- 
ability distribution p(ab\xy), which can of course depend 
on the particular experiment being performed. By re- 
peating the experiment a sufficient number of times and 
collecting the observed data, one can get a fair estimate 
of such probabilities. 

When such an experiment is actually performed - say, 
by generating pairs of spin-i particles and measuring the 
spin of each particle in different directions - it will in 
general be found that 



p(ab\xy) ± p(a\x) p(b\y) , 



(1) 



implying that the outcomes on both sides are not sta- 
tistically independent from each other. Even though the 
two systems may be separated by a large distance - and 
may even be space-like separated the existence of such 
correlations is nothing mysterious. In particular, it does 
not necessarily imply some kind of direct influence of one 
system on the other, for these correlations may simply re- 
veal some dependence relation between the two systems 
which was established when they interacted in the past. 
This is at least what one would expect in a local theory. 
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Let us formalize the idea of a local theory more pre- 
cisely. The assumption of locality implies that we should 
be able to identify a set of past factors, described by 
some variables A, having a joint causal influence on both 
outcomes, and which fully account for the dependence 
between a and b. Once all such factors have been taken 
into account, the residual indctcrminacies about the out- 
comes must now be decoupled, that is, the probabilities 
for a and b should factorizc: 



p(ab\xy, A) = p(a\x, X)p(b\y, A) . 



(2) 



This factorability condition simply expresses that we 
have found an explanation according to which the prob- 
ability for a only depends on the past variables A and on 
the local measurement x, but not on the distant measure- 
ment and outcome, and analogously for the probability 
to obtain b. The variable A will not necessarily be con- 
stant for all runs of the experiment, even if the procedure 
which prepares the particles to be measured is held fixed, 
because A may involve physical quantities that are not 
fully controllable. The different values of A across the 
runs should thus be characterized by a probability dis- 
tribution q(X). Combined with the above factorability 
condition, we can thus write 



p(ab\xy)= I dXq(X)p(a\x,X)p(b\y,X) . 



(3) 



where we also implicitly assumed that the measurements 
x and y can be freely chosen in a way that is independent 
of A, i.e., that q(X\x,y) = q(X). This decomposition now 
represents a precise condition for locality in the context 
of Bell experiments 2 . Note that no assumptions of deter- 
minism or of a "classical behaviour" are being involved 
in the condition ([3]) : we assumed that a (and similarly b) 
is only probabilistically determined by the measurement 
x and the variable A, with no restrictions on the physical 
laws governing this causal relation. Locality is the crucial 
assumption behind ([3]). In relativistic terms, it is the re- 
quirement that events in one region of space-time should 
not influence events in space-like separated regions. 

It is now a straightforward mathematical theorem 3 
that the predictions of quantum theory for certain exper- 
iments involving entangled particles do not admit a de- 
composition of the form To establish this result, let 



2 Bell also used the term local causality instead of locality. Local 
hidden-variable or local realistic models are also frequently used 
to refer to the existence of a decomp osition of the form {3]l ; see 
(Goldstein et ai. 2011; Norscn, 2009) for a critical discussion of 
these terminologies. 

3 It is relatively frequent to see on the arXiv a paper claiming to 
"disproove" Bell's theorem or that a mistake in the derivation 
of Bell inequalities has been found. However, once one accepts 
the definition (|3]l , it is a quite trivial mathematical theorem that 
this definition is incompatible with certain quantum predictions. 
Such papers are thus either using (possibly unawarely) a different 
definition of locality or they are erroneous. 



us consider for simplicity an experiment where there are 
only two measurement choices per observer 1,1/6 {0, 1} 
and where the possible outcomes take also two values la- 
belled a, b G { — 1, +1}- Let (a x b y ) = J2 a b abp(ab\xy) be 
the expectation value of the product ab for given mea- 
surement choices (x, y) and consider the following expres- 
sion S = (ao6o) + (aabi) + (a±bo) — (ai&i), which is a 
function of the probabilities p(ab\xy). If these probabili- 
ties satisfy the locality decomposition (J3j , we necessarily 
have that 



S = (a b ) + (ao&i) + (ai&o) - (aih) < 2 , 



(4) 



which is known a s the Clause r -Horn e-Shimony-Holt 
(CHSH) inequality (|Clauser et aLl . ll969f ). To derive this 



inequality, we can use © in the definition of {a x b yh 
which allows us to express this expectation value as an 
average (a x b y ) = JdXq(X)(a x )\(b y )\ of a product of 
local expectations (a x }\ = J2 a a Pi a \ x i X) and (b y )\ = 
^2(,p{b\y,X) taking values in [—1,1]. Inserting this ex- 
pression in Q, we can write S = JdXq(X)S\, with 
S\ = (ao)\(bQ)\ + (a )x(bi)x + (ai)\(b )x - (ai) x (bi)\. 
Since (ao)\, (ai)\ 6 [—1, 1], this last expression is smaller 
than5 A < | <&o> A + <&i) A | + 1 <&0> A - <bi> aI - Without loss of 
generality, we can assume that (&o)a > (&i)a > which 
yields S\ = 2(b ) x < 2, and thus S = JdXq(X)Sx < 2. 

Consider now the quantum predictions for an ex- 
periment in which the two systems measured by Alice 
and Bob are two qubits in the singlet state \^f~) = 
^ ( 1 01) — 1 1 0) ) , where we have used the shortcut no- 
tation \ab) = \a) ® \b), and where |0) and |1) are con- 
ventionally the eigenstates of a z for the eigenvalues +1 
and —1 respectively. Let the measurement choices x 
and y be associated with vectors x and y correspond- 
ing to measurements of x ■ a on the first qubit and of 
y ■ a on the second qubit, where a = (<7i, <72, 03) de- 
notes the Pauli vector. According to quantum theory, 
we then have the expectations (a x b y ) = —x ■ y. Let 
the two settings x G {0, 1} correspond to measurements 
in the orthogonal directions e\ and e-i respectively and 
the settings y G {0, 1} to measurements in the direc- 
tions — (ei + e2)/v / 2 and (— e\ + e2)j\/2. We then have 
(a 6 ) = (a &i) = (ai&o) = l/\/2 and {a x b\) = -l/\/2, 
whence 



S = 2V2 > 2, 



(5) 



in contradiction with (U) and thus with the locality con- 
straint (J3j . This is the content of Bell's theorem, estab- 
lishing the non-local character of quantum theory and of 
any model reproducing its predictions. 

The CHSH inequality @ is an example of a Bell in- 
equality, a linear inequality for the probabilities p(ab\xy) 
that is necessarily verified by any model satisfying the lo- 
cality condition ([3J , but which can be violated by suitable 
measurements on a pair of quantum particles in an entan- 
gled state. The violation of these inequalities and the pre- 
dictions of quantum theory were first con vincingly con- 
firmed by the experiment of Aspect et al ( Aspect et al\ . 
Il982bl ) and in many other experiment since. 



4 



Before outlining in more detail the content of the 
present article, let us first reconsider Bell's locality con- 
dition from a more operational perspective, which illus- 
trates the spirit underlying this review. 

B. The limitations of non-communicating PhD students 

Consider a quantum apparatus which can perform a 
measurement on a quantum system in a state pa- If 
measurement x is chosen, an output a is obtained. Quan- 
tum theory predicts the statistics p(a\x) for the outcomes 
given the measurements. Suppose that a PhD student, 
who cannot realize such a quantum experiment, is in- 
stead provided with unlimited classical computational 
power and a source of random numbers. If the student 
is competent, he can simulate the same statistics as in 
the quantum experiment based only on the description of 
the state pa and of the measurement x to be performed 
on it. This is not a particularly deep remark: it is the 
daily job of physicists all over the world and an obvious 
consequence of the fact that the theory allows predicting 
the results. 

Now, consider two quantum devices in two distant lo- 
cations A and B performing measurements x and y on 
two systems in a joint state pab- Quantum theory al- 
lows computing the joint probabilities p(ab\xy), so cer- 
tainly the above student can simulate the experiment if 
he is given all the relevant information. However, in the 
quantum experiment the two locations can be sufficiently 
separated so that no information on y is available at the 
location A before a result is obtained, and similarly no 
information on x is available at B. Can two students, 
one at A and the other at B, simulate the quantum 
statistics in the same circumstances? As before, the stu- 
dents cannot manipulate any quantum systems, but they 
have unlimited computational power, access to a source 
of random numbers, and a perfect description of the joint 
state pab- Though they cannot communicate once the 
measurements are specified, they may have set-up in ad- 
vance a common strategy and have shared some common 
classical data A, which may vary across different simula- 
tion runs according to a probability distribution q(\). In 
full generality, the output of the first student will thus 
be characterized by a probability distribution p(a\x, A), 
which is fixed by their common strategy and the joint 
state Pab, but which may depend on the specific mea- 
surement x chosen and of the data A shared with the 
second student. Similarly the output of the second stu- 
dent is given by a probabilistic function p(b\y, A). The 
joint statistics simulated by the two students are thus 
characterized by the probabilities 

p(ab\xy) = q(X)P(a\x, A)p(%> A) , (6) 

A 

which is nothing but the locality condition This 
condition thus admits a very simple and operational in- 
terpretation: it characterizes the correlations that can 



be reproduced with classical resources by our two non- 
communicating students. The fact that certain exper- 
iments involving entangled quantum states violate Bell 
inequalities then imply that the two students cannot 
simulate such experiments. The violations of Bell in- 
equalities can thus be interpreted as establishing a gap 
between what non-communicating observers having ac- 
cess to classical or to quantum entangled resources can 
achieve. Note that locality, i.e., the constraint that the 
two observers cannot communicate, is the important lim- 
itation here. As we said above, if all the information 
about x and y is available to one of the students, it is al- 
ways possible to reproduce the quantum statistics using 
only classical resources. 

There is another point worth stressing here. The 
fact that entangled quantum systems are able to do 
things completely differently than classical systems is well 
known. Indeed, for more than one century physicists have 
discovered that classical physics does not explain every- 
thing. However, if given only the statistics of a real quan- 
tum experiment and of a classical simulation of it, there 
is no way to tell the difference. The brute measurement 
data produced, for instance, by a Stern-Gerlach experi- 
ment can be simulated classically; its "non-classicality" 
becomes evident only when one takes into account that a 
magnetic moment is being measured and that the mea- 
surements are associated to the direction of the gradi- 
ent of a magnetic field. In the case of Bell nonlocal- 
ity, however, the real quantum experiment and its (at- 
tempted) simulation can be distinguished solely from the 
measurement data, without having to specify which phys- 
ical degree of freedom is involved or which measurements 
are performed. This property is referred to as device- 
independence. Interpreted in this way, the violation of 
Bell inequalities can be seen as a detector of entangle- 
ment that is robust to any experimental imperfection: as 
long as a violation is observed, we have the guarantee, 
independently of any implementation details, that the 
two systems are entangled. This remark is important: 
since entanglement is at the basis of many protocols in 
quantum information, and in particular quantum crypto- 
graphic protocols, it opens the way to device- independent 
tests of their performance. 



C. Scope of this review 

We have given here only a very succinct and intuitive 
presentation of the locality condition from which Bell's 
theorem follows. This naturally raises a first series of 
questions: What are the precise physical assumptions on 
which this condition is based? Can we rigorously justify, 
in particular on relativistic grounds, the notion of locality 
captured by this condition? To what extent does nonlo- 
cality, i.e. the violation of (J3)), conflict with relativity? 
What do the various interpretations of quantum theory 
have to say about this issue? We do not address here such 
questions, that have been the subject of extensive anal- 
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ysis and discussions by both physicists and philosophers 
of science since Bell's discovery. A recent concise review 



has b een written from this perspective ([Goldstein et al 
201l[) . Bell's collection of papers on the subject ( Bell 
2004 ) is a must read, in which he explains and devel- 



ops his main result from a variety of persp ectives. In 
particular, the two articles (|Bell 1 19751 ) and ((Belli . Il990h 



introduce the principle of local causality - a precise for- 
mulation of the notion of relativistic locali ty - from whic h 
the condition ([3]) can be derived, see also ( Norsenl 120071 ). 



For a discussion of t he implications of non-locality for 
relativity we refer to (jMaudlinl . |2002[) . 



The present review has a more technical flavour: How 
can one show that the measurement statistics of a given 
experiment do not satisfy the condition (|3])? How can 
one derive Bell inequalities in a systematic way? Which 
entangled states violate these inequalities, which ones 
do not? Can quantum nonlocality be exploited for in- 
formation processing, and if yes how? How should one 
design the best experimental test of quantum nonlocal- 
ity? Etc. Though they may have foundational motiva- 
tions or implications, the works discussed here have an 
original technical component. Many of them also fol- 
low a recent trend in which non-locality is considered 
from an operational perspective and where its relations 
with other topics in quantum information science, such 
as the theory of entanglement or cryptography, are in- 
vestigated. Finally, we focus on progress reported in 
the last fifteen years or so. For works on Bell non- 
locality before this period or for aspects not covered 



here, we refer to the following reviews (IBuhrman et al. 


2010l: Clauser and ShimonvL 19781 Genovesd. 


2005 


Home and Selleril. 19911: iKhalfin and Tsirelsonl 


1992 


Mermin, 19931: Tsirelsonl Il993l Werner and Woli 


2001 


Zeilingerl 19991) and references therein. 



arc viewed as a fundamental resource. Notably, these 
ideas stimulated a series of work trying to recover the 
structure of quantum correlations (and more generally of 
quantum theory itself) from information-theoretic prin- 
ciples. Section IVII is devoted to the nonlocality of mul- 
tipartite systems. The notions of genuine multipartite 
nonlocality and monogamy of correlations are discussed, 
as well as their relevance for quantum multipartite sys- 
tems. In Section lyill we succinctly review the impressive 
experimental work that has been achieved on quantum 
nonlocality, where Bell inequality violations have been 
demonstrated using a variety of different physical sys- 
tems and experimental configurations. We also discuss 
the loopholes that may affect Bell experiments, and re- 
port recent progress made towards a loophole-free Bell 
experiment. Finally, section IVIIII deals with variations 
around Bell's theorem, in which different notions of non- 
locality - stronger or weaker than Bell's - are considered. 
Finally, the appendix provides a guide referencing Bell 
inequalities for a wide range of Bell scenarios. 



II. MATHEMATICAL CHARACTERIZATION OF 
NONLOCAL CORRELATIONS 

This section presents the main concepts and tools for 
characterizing nonlocal correlations. The notations in- 
troduced here will be used throughout this review. For 
the sake of clarity, the discussion focuses mainly on the 
case of two observers, generalizations to the multipartite 
case being usually straightforward (see also sections lILDI 
and IVII for results specific to the multipartite case). 



A. General definitions 



D. Outline 

A succinct outline of this review is as follows. Sec- 
tion |H] is devoted to setting-up some general definitions 
and presenting a mathematical characterization of nonlo- 
cal correlations. In particular, we study the general prop- 
erties of correlations that can arise between local, quan- 
tum and no-signalling systems. We address the prob- 
lem of deriving Bell inequalities from the locality con- 
dition ([3]) and of determining their maximal quantum 
violations. Section HTT1 addresses nonlocality in quantum 
theory. The main question is to understand how quan- 
tum nonlocality relates to certain properties of quantum 
resources, such as entanglement and Hilbcrt space di- 
mension. The relation between nonlocality and informa- 
tion is discussed in sections |IV] and LY] We first present 
in section IIVI various applications of quantum nonlocal- 
ity, such as communication complexity, quantum cryp- 
tography, and device-independent quantum information 
processing. Section fVl provides an information-theoretic 
perspective on nonlocality, in which nonlocal correlations 



As in the Introduction, we consider two distant ob- 
servers, Alice and Bob, performing measurements on a 
shared physical system, for instance a pair of entangled 
particles. Each observer has a choice of m different mea- 
surements to perform on his system. Each measurement 
can yield d possible outcomes. Abstractly we may de- 
scribe the situation by saying that Alice and Bob have 
access to a "black box". Each party selects locally an 
input (a measurement setting) and the box produces an 
output (a measurement outcome). We refer to this sce- 
nario as a Bell scenario. 

We label the inputs of Alice and Bob x, y S {1, . . . , m} 
and their outputs a, b € {1, . . . , d}, respectively. The la- 
bels attributed to the inputs and outputs are purely con- 
ventional, and all the results presented here are indepen- 
dent of this choice. Some parts of this review might use 
other notations for convenience. In particular when the 
outputs are binary, it is customary to write a, b E {— 1, 1} 
or a, b e {0, 1}. 

Let p(ab\xy) denote the joint probability to obtain the 
output pair (a, 6) given the input pair (x,y). A Bell 
scenario is then completely characterized by d 2 m 2 such 
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joint probabilities, one for each possible pairs of inputs 
and outputs. F ollowing the terminology introduced in 
( Tsirelsonl . il 9931 ). we will refer to the set p = {p(ab\xy)} 
of all these probabilities as a behavior. Informally, we 
will simply refer to them as the correlations character- 
izing the black box shared by Alice and Bob. A behav- 

>2 2 

ior can be viewed as a point p G M. belonging to 

i2 2 

the probability space V C K defined by the pos- 
itivity constraints p{ab\xy) > and the normalization 
constraints b=1 p(ob\xy) = 1. Due to the normaliza- 
tion constraints V is a subspace of R d " m2 of dimension 
dim-p = (a? - l)m 2 . 

The existence of a given physical model behind the 
correlations obtained in a Bell scenario translates into 
additional constraints on the behaviors p. Three main 
types of correlations can be distinguished. 



1. No-signaling correlations 

A first natural limitation on behaviors p are 
the no-signaling co nstrai nts (ICirel'so 
iPopescu and Rohrlichl Il994j ). formally expressed 



^2p(ab\xy) = ^2p(ab\xy') for all a, x, y, y 

6=1 6=1 
d d 

~^^p{ab\xy) =~^^p(ab\x' y) for all b, y, x, x' . (7) 



a=l 



a=l 



These constraints have a clear physical interpretation: 
they imply that the the local marginal probabilities of 

Alice p(a\x) = p{a\xy) = Ylb=i P{ a ^\ x v) are independent 
of Bob's measurement setting y, and thus Bob cannot 
signal to Alice by his choice of input (and the other way 
around). In particular, if Alice and Bob are space- like 
separated, the no-signaling constraints (JT]) guarantee that 
Alice and Bob cannot use their black box for instanta- 
neous signaling, preventing from a conflict with relativity. 

Let AfS denote the set of behaviors satisfying the no- 
signaling constraints ([7]). It is not difficult to see that 



ATS is an affinc subspace of M. d m 
dimJVS = 2(d- l)n + (d 



of dimension 



l) 2 m 2 =:t 



(8) 



see e.g. ()Pironid . l200i . One can thus parametrize points 
in AfS using t numbers rather than the d 2 m 2 numbers (or 
(d 2 — l)m 2 taking into account normalization) necessary 
to specify a point in the general probability space V . A 
possible paramctrization is given by the set of probabil- 
ities: {p(a\x) , p(b\y) , p(ab\xy)} where a, b = 1, . . . , d — 1 
and x,y = 1, . . . ,n. There are indeed t such probabilities 
and their knowledge is sufficient to reconstruct the full 
list of p(ab\xy) for any a, 6, x, y. Seen as a subset of M* , 
the no-signaling set is thus uniquely constrained by the 
d 2 m? positivity constraints p(ab\xy) > (which have to 
be re-expressed in term of the chosen paramctrization). 



In the case of binary outcome (d = 2), an alternative 
parametrization is provided by the 2m + m 2 correlators 
{(A x ),(B y ),(A x B y )}, where 

(A x )= a P( a \ x ) AB y )= £ bp(b\y), (9) 

ae{±i} be{±i} 



a,6G{±l} 



ab p{ab\xy) , 



(10) 



and we have assumed a, 6 £ { — 1,1}. Joint probabili- 
ties and correlators are related as follows: p(ab\xy) = 
[1 + a{A x ) + b(B y ) + ab(A x B y )} /4. Thus an arbitrary 
no-signaling behavior must satisfy 1 + a(A x ) + b ( B„) + 
ab(A x B v ) > for all a,b,x,y. See (|Bancal et all l2010h 
for a more general definition of correlators for the d > 2 
case. 

A particular subset of interest of AfS in the d = 2 
case is the one for which (A x ) = (B y ) = 0. We will 
refer to this set as the correlation space C. In term of 
the m 2 correlators (|10[) . an arbitrary point in C is only 
constrained by the inequalities — 1 < (A x B y ) < 1. Bell 
inequalities that involve only the quantities (A x B y ), like 
the CHSH inequality, are called correlation inequalities. 



2. Local correlations 

A more restrictive constraint than the no-signaling 
condition is the locality condition discussed in the Intro- 
duction. Formally, the set C of local behaviors is defined 
by the elements of V that can be written in the form 

P (ab\xy)= [ dX q(X)p(a\x, X)p(b\y, A) (11) 



where the (hidden) variables A are arbitrary variables 
taking value in a space A and distributed according to 
the probability density q(X) and where p(a\x, A) and 
p(b\y, A) are local probability response functions for Alice 
and Bob, respectively. Operationally, one can also think 
about A as shared randomness, that is some shared clas- 
sical random bits, where Alice will choose an outcome a 
depending both on her measurement setting x as well as 
A and similarly for Bob. 

Whereas any local behavior satisfies the no-signaling 
constraint, the converse does not hold. There exist no- 
signaling behaviors which do not satisfy the locality con- 
ditions. Hence the set of local correlations is strictly 
smaller than the set of no-signaling correlations, that is 
C c AfS. 

Correlations that cannot be written in the above form 
are said to be nonlocal. Note that this can happen only if 
d > 2 and m > 2 (otherwise it is always possible to build 
a local model for any behavior in V). In the following, 
we thus always assume d > 2, m > 2. 
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3. Quantum correlations 

Finally, we consider the set of behaviors achievable in 
quantum mechanics. Formally, the set Q of quantum 
behaviors corresponds to the elements of V that can be 
written as 



p(ab\xy) = tr (p AB M a \ x (g) M b \ y ) 



(12) 



where pab is a quantum state in a joint Hilbert space 
Ha <S> Hb of arbitrary dimension, M a \ x are measure- 
ment operators on Ha characterizing Alice's measure- 
ments (thus M a \ x > and Y^a=i M a \x = l)i an d simi- 
larly M b \ y are operators on Hb characterizing Bob's mea- 
surements. 

Note that without loss of generality, we can always as- 
sume the state to be pure and the measurement operators 
to be orthogonal projectors, if necessary by increasing the 
dimension of the Hilbert space. That is, we can equiva- 
lcntly write a quantum behavior as 



p{ab\xy) = (ip\M a \ x <g> M b \ y \ 



(13) 



where M a \ x M a ,\ x 



Saa'M alx , J2a M a\x = and simi- 
larly for the operators M b \ y . 

A different definition of quantum behaviors is also pos- 
sible, where instead of imposing a tensor product struc- 
ture between Alice and Bob syste ms, we merely re quire 
that their local operators commute ( Tsirelsonl . 1 1993T ) . We 
call the corresponding set Q' , i.e., a behavior p belongs 
to Q! if 



p(ab\xy) = {ip\M a \ x M b \ y \ 



(14) 



where \ip) is a state in a Hilbert space H, and M a i x and 
M b \ y are orthogonal projectors on H defining proper mea- 
surements and satisfying [MaU, M b i y ] = 0. The former 



definition (|13[) is standard in non-relativistic quantum 
theory, while the second one (|14l) is natural in rclativistic 
quantum field theory. Since [M a \ x <E> 1b, A a <8> M b i y ] = 
it is immediate that Q C Q' . It is an open ques- 
tion, known as Tsirelson's p roblem, whet h er the inclu- 
sion is strict, i.e., Q ^ Q' (iFritz . 12012a! : Jjunge etal\ . 
l2010al : IScholz and Wernerl 120081 : iTsirelsoil Il993h . In the 
case where the Hilbert spaces H, Ha, Hb are finite, it is 
known that the definitions (ITBl and (TBI) coincide (see e.g . 
(iDohertv et all l2008t iNavascues et all l201lt iTsirelsonl 
1 19931 )). It is also known that Q = Q' if Alice has a binary 
choice of inputs with two outputs each, independently of 
Bob' s number of inputs and outputs ( Navascues et all 
1 2 1 lh . (Or more precisely in this case any element of 
Q! can be approximated arbitrarily well by an element 
of Q). For many applications and results, it does not 
matter whether we consider the quantum sets Q or Q' . 
In the following, we will drop the distinction and use the 
notation Q to refer to both sets, except when results are 
specific to only one definition. 

Clearly, any local behavior admits a description of 
the form (TT2"]) and thus belongs to Q. Moreover, any 



Bell inequality 




FIG. 2 Sketch of the no-signaling (MS), quantum (Q), and lo- 
cal (£.) sets. Notice the strict inclusions £c2C ATS. More- 
over, TV'S and C are polytopes, i.e., they can be defined as 
the convex combination of a finite number of extremal points. 
The set Q is convex, but not a polytope. The hyperplanes 
delimiting the set C correspond to Bell inequalities. 



quantum behavior satisfies the no-signaling constraints. 
However, there are quantum correlations that do not 
belong to the local set (this follow from the violation 
of Bell inequalities) and, as we will see, there are no- 
sign aling correlations that do not belong to the quantum 
set dKhalfin and Tsirelsonl 1 19851 : iPopescu and Rohrlichl . 
11994 lRastalll . ll985t) . In general, we thus have the strict 
inclusions £cQC NS (see Fig. [5]). Furthermo re, it can 
be sh own that dim£ = dim Q = dimA/\S = t ( Pironiol . 
l2005h where t is defined in ©. 

In the following sections, we discuss the properties of 
£, Q, and AfS in more details. In particular, we will see 
how it is possible to decide if a given behavior belongs or 
not to one of these sets. We will show how each set can 
be characterized in terms of Bell-type inequalities and 
discuss how to compute bounds for Bell-type expression 
for behaviors in C, Q, and MS. 



B. Bell inequalities 

The sets £, Q, and MS are closed, bounded, and con- 
vex. That is, if pi and P2 belong to one of these sets, then 
the mixture /ipi + (1 — /i)p2 with < p, < 1 also belongs 
to this set. The convexity of Q can be established fo r 
instance by following the argument in ( Pitowskvl . Il986t) . 
By the hyperplane separation theorem, it follows that for 
each behavior pgi* that does not belong to one of the 
sets JC = £, Q, or MS there exists a hyperplane that sep- 
arates this p from the corresponding set (see Figure [2]) . 
That is if p ^ JC, then there exists an inequality of the 
form 



s p 



E 

abxy 



sf v p(ab\xy) < S k 



(15) 
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that is satisfied by all p S K, but which is violated by p: 
s-p > 3}.. In the case of the local set £, such inequalities 
are simply Bell inequalities. Thus any nonlocal behavior 
violates a Bell inequality. An example of such an inequal- 
ity is the CHSH inequality ((4]) that we have introduced in 
section II. Al The inequalities associated to the quantum 
set, which characterize the limits of Q, are often called 
quantum Bell inequalities or Tsirelson inequalities. 

In the following, we will refer to an arbitrary s g R f as 
a Bell expression and to the value Si such that s • p < Si 
holds for all p € C as the local bound of this Bell expres- 
sion. Similarly, wc define the quantum bound S q and the 
no-signaling bound S ns as the analogue quantities for the 
sets Q and ATS. If S q > Si we also say that quantum 
mechanics violates the Bell inequality s • p < Si. When 
such a behavior is observed one speaks of a Bell inequality 
violation. 



iment. Since the total number of inputs and outputs is 
finite, there can only be a finite number of such assign- 
ments, and hence a finite number of hidden variables. 

More precisely, we can rephrase the local model (fTTj) 
as follows. Let A = (oi, . . . , a m ; bo, ... , b m ) define an 
assignment of outputs a x and b y for each of the inputs 
x = 1, ... ,771 and y — 1, . . . , m. Let d x € C denote the 
corresponding deterministic behavior: 



d x (ab\xy) = 



1 if a = a x and b 
otherwise. 



(17) 



There are d 2m possible output assignments and therefore 
d 2m such local deterministic behaviors. A behavior p is 
local if and only if it can be written as a convex combi- 
nation of these deterministic points, that is if 



p = ^2 qxdX > with qx - °> X] qx = 1 



(18) 



1. The local polytope 

Let us now investigate how Bell inequalities, i.e., the 
hypcrplancs characterizing the set C, can be found. To 
this end, it is useful to note that we can express local 
correlations in a simpler form. A first step is to real- 
ize that local correlations can, equivalently to the def- 
inition (jlip . be defined in terms of deterministic local 
hidden-variable models. In a deterministic model, the 
local response functions p(a\x, A) and p(b\y, A) only take 
the values or 1, that is, the hidden variable A fully spec- 
ifies the outcome that is obtained for each measurement. 
No such requirement is imposed on the general stochastic 
model (|lip . That both definitions are equivalent follows 
from the fact that any local randomness present in the 
response function p{a\x,\) and p(b\y, A) can always be 
incorporated in the shared random variable A. To see 
this, introduce two parameters a*iA*2 € [0, 1] in order to 
define a new hidden- variable A' = (A^i,^)- Let 



p'(a\x,X') 



1 if F(a- l\x,X) < Ati < F(a\x,X) 
otherwise, 

(16) 

where F(a\x, A) = J2a<a P( a \ x > ty> De a new response 
function for Alice and define a similar one for Bob. If we 
choose g'(A') = q'(X, a«i, \x-x) = q(X) for the new hidden 
variable distribution, that is if we uniformly randomize 
over a*i and /i2, we clearly recover the predictions of the 
general, stochastic model (fTTj) . The newly defined model, 
however, is deterministic. Thi s equ i valenc e between the 
two models was first noted in (|Find . 11982ft . 

We can further simplify the definition by noting that 
we only need to consider a finite number of hidden vari- 
ables. Indeed, in a deterministic model, each hidden 
variable A defines an assignment of one of the possible 
outputs to each input. The model as a whole is a proba- 
bilistic mixture of these deterministic assignments of out- 
puts to inputs, with the hidden variable specifying which 
particular assignment is chosen in each run of the exper- 



This last representation is particularly useful as it pro- 
vides an algo rithm for determining if a given behav- 
ior p is local (jKaszlikowski et al I l2000t IZukowski et all . 
Il999l ). Indeed, determining whether there exist weights 
q\ satisfying the linear constraints in Eq. (fT8|) is 
a t ypical instance of a linear programming prob- 
lem ( Bovd and Vandenberghei . |2004 ) for which there ex- 
ist algorithms that run in time that is polynomial in the 
number of variables. Note, however, that since there are 
d 2m possible A the size of this particular linear program is 
extremely large and hence the algorithm is not efficient 
by itself. Every linear program comes in a primal and 
dual form. The dual form of the linear program asso- 
ciated to (|18|) has an interesting physical interpretation. 
Indeed, it can be formulated as 



ma x (s , S!) s ■ p 
s.t. sd A 
s- p- 



Si 

-Si<0, 
Si<l. 



A = 1, 



j2m 



(19) 



If p is local, the maximum S of the above program is 
S < 0. If p is non-local, the maximum is S = 1, i.e., 
the program returns an inequality s • d x < Si satisfied by 
all deterministic points (and hence, by convexity, by all 
local points) , but violated by p: s • p = Si + 1 > Si . That 
is the program (|19p provides a procedure for finding, for 
any p, a Bell inequality that detects its non- locality. 

Since the set C is the convex hull of a finite number of 
points, it is a polytope. The local deterministic behaviors 
d A correspond to the vertices, or extreme points, of the 
polytope. It is a basic result in polyhedral theory, known 
as Minkowski's theorem, that a polytope can, equiva- 
lently to the representation (jT5J) as the convex hull of 
its vertices, be represented as the intersection of finitely 
many half-spaces. Hence, we have that p S C if and only 
if 



P<Sl Vie/, 



(20) 



where / indexes a finite set of linear inequalities. If on 
the other hand p is nonlocal, it necessarily violates one 
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of the inequalities in ((20)) . Thus the local set C can be 
characterized by a finite set of Bell inequalities. 



2. Facet Bell inequalities 

If s • p < Si is a valid inequality for the polytope £, 
then F = {p e £ | s ■ p = Si} is called a face of C. 
Faces of dimension dim F = dim C — 1 = t — 1 are called 
facets of C and the corresponding inequalities are called 
facet Bell inequalities. The terminology "tight Bell in- 
equalities" is also used 4 . Facet inequalities are impor- 
tant because they provide a minimal representation of 
the set C in the form (|2"0")) : minimal as they are necessar- 
ily required in the description (|20[) . and since any other 
Bell inequality can be written as a non-negative combi- 
nation of the facet inequalities. These notions are eas- 
ily understood and visualised in two or three dimensions 
(note, however, that our low-dimensional intuition is of- 
ten unreliable in higher dimensions). A more general dis- 
cussion of polytope theory ( not applied to Bell inequal- 
ities) can be found in, e.g., (ISchriiverl Il989h or ()Zieglerl . 
I1995D . The connection between optimal Bell inequalities 
and polytope theo r y was realiz e d by different authors 
dGarg and Merminl. Il984 iPerest Il999t iPitowskvl 11989 : 



19811) 



Werner and Wo lf. 2002), and most notably by (jFroissard , 



While facet Bell inequalities provide a practical de- 
scription of the local polytope C, this polytope is, how- 
ever, defined in term of its vertices, the local determin- 
istic behaviors d A . The task of determining the facets of 
a polytope, given its vertices, is known as the facet enu- 
meration or convex hull problem. For sufficiently simple 
cases, it is possible to obtain all t he facets with the help 
of computer codes, such as edd ( Fukudal . 0) or porta 
( Christof and Lobell . D), 

which are specifically designed 
for convex hull computations. However, such programs 
become rapidly prohibitively time consuming as the num- 
ber of parties, inputs or outputs grow. Note also that the 
simpler problem of determining whether a behavior is lo- 
cal using the linear program associated to (TT9")) becomes 
also rapidly impractical for large number of inputs m 
since the number of deterministic points scale exponen- 
tially with m. Results in computer scienc e tell us that 
this p roblem is in general extremely hard ( Babai et all 
199lh . More precisely, it has been shown that deciding 
whether a behavior is local for the class of Bell scenarios 
with binary outpu ts (d — 2) and m inputs is NP-complctc 
(lAvis et all l2004h . It is therefore highly unlikely that 
the problem of characterizing the local polytope admit a 
simple solution in full-generality 

In the following, we list some facet Bell inequalities 
of interest. Note that the positivity conditions (corre- 



4 Note however that in polytope theory a tight inequality refers 
merely to an inequality that "touches" the polytope, i.e., such 
that F^8. 



sponding to p{ab\xy) > 0) are always facets of the local 
polytope, but obviously are never violated by any phys- 
ical theory. All other facet inequalities are violated by 
some no-signaling behaviors, and possibly by some quan- 
tum behaviors. It is in fact an open question whether 
there exist facet inequalities in the bipartite case (dif- 
ferent than the positivity ones) that are not violated by 
any quantum behaviors (such i nequalities are known in 
Bell scenarios with more parties (|Almeida et aLl . [2010a|) V 
Note also that if an inequality defines a facet of the lo- 
cal polytope then it is obviously also the case for all the 
inequalities obtained from it by relabeling the outputs, 
inputs, or parties. What we mean thus in the follow- 
ing by "an inequality" is the whole class of inequalities 
obtained by su ch operations. Finally, it was shown in 
(iPironiol [2005h ;hat there exists a hierarchical structure 
in the facial structure of local polytopes, in the sense that 
a facet Bell inequality of a given polytope with d outputs 
and m inputs can always be extended (or lifted) to any 
polytope with d' > d and/or m' > m (and also to poly- 
topes corresponding to more parties) in such a way as to 
define a facet of the new polytope. 



3. Examples 

The simplest non-trivial Bell scenario corresponds to 
the case d = 2, m = 2. The corr esponding l ocal p olytope 
was completely characterize d in ( Froissard Il98ll ) and in- 
dependently in (iFind . |l982) . In this case, there is only 
one (non-trivial) facet inequality : the CHSH i nequality 
introduced in (j4]). It is shown in ( Pironiol . I2004T) that the 
CHSH inequality is also the only facet inequality for all 
polytopes with 2 inputs and 2 outputs for Alice and an 
arbitrary number of inputs and outputs for Bob. 

The case d = 2, m = 3 was computationally solved 
in ( Froissardl . [l98l|) who found that, together with the 
CHSH inequality, the following inequality 



Pi 



-Pi -PU—P12— Pl3 

-P21 - P31 - P22 + P23 + P32 > — 1 



(21) 



is facet defining, where p% = p{a = l|x), Py = p(b = l|y), 
and p xy = p(a = l,b = l\xy). This result w a s late r 
on redcrived in (jCollins and Gisinl . |2004 ISliwal . 120031) . 
The Froissard inequality is also refere ed as the J3322 in- 
equal ity, following the terminology of ( Collins and Gisinl . 
120041) . " Note that this inequality could be generalized 
for the case of an arbitrary number of measurements m 
with binar y outcomes, a fam i ly kn own as the I mm 22 in- 
eo ualities (ICollins and Gisinl . |2004| ). proven to be facets 
in (|Avis and ltd |2007| ). 

For d arbitrary and m = 2, (| Collins et all l2002al ) in- 
troduced the follo wing inequality (we use the notation of 
(lAcm et q*J . l2005l) 



[ai-6i] + [6i-02]-r-[o2-6 2 ] + [b2-oi-l] > d-1 (22) 



where [a x - b y ] = J2j=o JP( a 
similarly for the other terms. 



b = j mod d\xy) and 
Note that for conve- 
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nience the measurement outcomes are now denoted a, b € 
{0, 1, d — 1}. This inequality is known as the CGLMP 
inequality. For d = 2, it reduces to the CHSH inequal- 
i ty. It has been shown to be facet-defining for all d in 
(|Masaned . l2003l) . 



The above inequality can be extended t o an arbitrary 
numb er of inputs m in the following way ( Barrett et all 
I2007T) 



-b\] + [bi - a 2 ] 
. . + [a m — b m ] - 



- [«2 
[b m 



b 2 ] 

■ ai - 



l]>d-l (23) 



Though this Bell inequality is not a facet inequality, 
it is useful in several contexts. In the case d = 
2, it reduces to the chained inequality i ntroduced in 



(|Braunstein and Caved . Il990t iPearld . Il970l) . 

Beyond these simple cases, a huge zoology of Bell in- 
equalities has been derived and it would be impossible 
to discuss them all here in detail, in particular given the 
increase of complexity with larger values of d and m. 
For instance, in the case d = 2, there is only one (non- 
trivial) facet Bell inequality for m = 2, two inequali- 
ties for m = 3, but already for m = 4 their number 
is not known ( Brunner and Gishl . 120081) . F or m = 10 , 
there are at least 44, 368, 793 inequalities (jAvis et all 
120041) (and this fi gure is probably a gross underestimate)! 
To complete the simple examples given above, let us 
mention some recent papers where new Bell in e quali- 
ties have been d e rived. In ( Brunner and Gisir] . l2008t 
ICollins and Gisinl . l2004h . several facet Bell inequalities 
have been obtained by solving numerically the convex 
hull problem for small va l ues of d and m. In ( Avis et all 



null problem 1 

120041 . 120051 : lAvis and ltd . 120071) large families of Bell in- 
equalities are obtained by establishing a relation between 
the local polytope for d = 2 and a high-dimensional 
convex polytope c alled the cut po l ytope in polyhedral 
combinatorics. I n dPal and Vertesil . Hooa IVertesil 120081: 
IVertesi and Pal . I2008D new algorithms have been pro- 
posed to construct families of facet and non-facet Bell 
inequalities in the d — 2 case. Methods exploiting sym- 
metries to generate Bell inequalities for arbitrary d and 
m (and an arbitrary number n of p arties) are investi- 
gated in (jBancal et all l2012al . [2010h . Finally, while we 
focused here on the case where d and m are finite, it 
is also possible to define Bell inequali ties taking a con 
tinuous set of values for the outputs (ICavalcanti et al 



20071: ISalles et all [2oToh or the input s i|Aharon etal. 



20121 : iKaszlikowski and Zukowskil I1999D . 



4. Nonlocal games 

Bell inequalities are also referred to as nonlocal games 
or sometimes simply games. Looking at Bell inequalities 
through the lens of games often provides an intuitive un- 
derstanding of their meaning. Such games enjoy a long 
history in computer science where they are known as in- 
teractive proof systems (see ( Condor] , Il989l ) for an early 



the quantum setting, under the name of intera ctive proof 
systems with entanglement ( Cleve et all [20041 ) . In order 
to make such literature accessible, let us see how the two 
concepts can be translated into each other 5 . 

When talking about a game, we imagine that there is 
an outside party, the referee that plays the game against 
Alice and Bob. In this context, parties or systems are 
referred to as players. Papers dealing with interactive 
proof systems also refer to the referee as the verifier, and 
to the players as provers. The referee chooses a question 
x 6 X for Alice and y 6 Y for Bob according to some 
probability distribution it : X x Y — >• [0,1] from some set 
of possible questions X and Y. Upon receiving x from 
the referee, Alice (Bob) return an answer a € Ra {b € 
Rb) from some set of possible answers Ra (Rb)- The 
referee then decides whether these answers are winning 
answers for the questions he posed according to the rules 
of the game. These rules are typically expressed by means 
of a predicate V : X x Y x R A x R B —> {0, 1} where 
V(a, 6, x,y) = 1 if and only if Alice and Bob win against 
the referee by giving answers a, b for questions x and y. 
To emphasize the idea that the correct answers depend 
on the questions given, one often writes the predicate as 
V(a,b\x,y). 

Alice and Bob arc fully aware of the rules, that is, they 
know the predicate V and the distribution it. Before the 
game starts, they can agree on any strategy that may 
help them thwart the referee. However, once the game 
started they can no longer communicate. In particular, 
this means that Alice does not know which question is 
given to Bob and vice versa. In the classical setting, 
such a strategy consists of shared randomness, which is 
the computer science name for local hidden variables. In 
the quantum case, Alice and Bob's strategy consists of a 
choice of shared quantum state and measurements. 

The relation between games and Bell inequalities be- 
comes apparent by noting that the questions are simply 
labels for measurement settings. That is, using our ear- 
lier notation we can take X = Y = {1, . . . , m}. Note that 
we can without loss of generality assume that the num- 
ber of settings \X\ and \Y\ are the same - otherwise, we 
can simply extend the number of settings for each party 
but never employ them. Similarly, the answers corre- 
spond to measurement outcomes. That is, we can take 
R A = R B = {l,...,d}. 

Any strategy leads to some particular probabilities 
p(a, b\x, y) that Alice and Bob give answers a, b for ques- 
tions x,y respectively. In the language of Bell inequal- 
ities, this is simply the probability that Alice and Bob 
obtain measurement outcomes a and b when performing 
the measurements labelled x and y. The probability that 
Alice and Bob win against the referee for some particular 



survey). More recently, they have also been studied in 



For the purpose of illustration, we will thereby restrict ourselves 
to the case of only two parties, Alice and Bob. However, the 
relation holds for an arbitrary amount of parties. 
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strategy can thus be written as 

x,y a,b 

In the classical or quantum setting, one can consider the 
maximum winning probability that Alice and Bob can 
achieve. For instance, considering classical resource, we 
have that 



maxp. 



Si 



(25) 



where the maximization is taken over all deterministic 
strategies of Alice and Bob. Note that this leads to the 
familiar form of a Bell inequality 



P<Si 



where the coefficients are given by 



n(x,y)V(a,b\x,y) . 



(26) 



(27) 



Note that these coefficients are always positive. Hence 
games form a subset of general Bell inequalities. It is 
however possible to transform a Bell inequality into one 
with only positive coefficients by adding a multiple of 
the identity, possibly followed by renormalization by an 
appropriate factor. In complexity theory, the winning 
probability is also often referred to as the value of the 
game. 



a. XOR games. A class of games t hat is very well un - 
dcrstood are so-called XOR-gamcs ( Cleve et M I2004D . 
In an XOR game, each player only has two possible 
answers a,b £ {0,1}. To decide whether Alice and 
Bob win, the referee computes the XOR c = a © b := 
a + b mod 2 and then bases his decision solely on c. 
For such games the predicate is generally written as 
V(c\x,y) := ^ a V{a,b = effi a\x,y). We will see later 
that it is easy to find the optimal quantum strategy for 
XOR games, and indeed the structure of their optimal 
measurements is entirely understood. Also, multi-player 
XOR games are reasonably well-understood and bounds 
relating the classical and quantum winning probabilities 
are known (|Briet and Vidickl . |2012[) . 

Let us simply note for the moment that XOR games 
are equivalent to correlation Bell inequalities with binary 
outcomes. Indeed, from Eqs. (JTUJ) it follows that we can 
write p(a © b = 0\x, y) = ± (1 + (A X B V )) and p(a © b = 
l|x, y) = I (1 — (A x B y )). The winning probability for an 
XOR game can thus be written as 



Pv 



-^7r(x,y) 



(28) 



.r.y 



V(c\x,y)(l + (-iy(A x B y )) , 

ce{o,i} 

which is the general form of a correlation Bell inequal- 
ities. XOR games can thus be recast as correlation in- 
equalities and vice versa. 



b. An example: CHSH as a game. An illustrative example 
of how correlation Bell inequalities transform into games 
and vice versa is provided by the CHSH inequality. For 
convenience, we will here take X = Y = {0, 1} (instead 
of {1, 2}), as well as R A = R B = {0, 1}. Viewing CHSH 
as a game, the rules state that Alice and Bob win if and 
only if x ■ y = a © b. Plugging this into (|2"5)) one obtains 



^chsh 

^win 



l 



(29) 



where 5* is the CHSH expression as given in (j4j). Indeed 
one has that 5* < 2 for any classical strategy. Hence, the 
probability for Alice and Bob to win the game using clas- 
sical resources is at most 3/4. Using quantum resources, 
the winning probability is at most (1 + 1/^/2) /2 sa 0.85, 
as given by Tsirelson's bound S < 2^2. 



c. Projection and unique games. A projection game is a 
game in which for every pair of questions x and y to Alice 
and Bob, and for every answer a of Alice, there exists a 
unique winning answer for Bob. In the quantum informa- 
tion literature, these are also often simply called unique 
games. However, in the classical computer science litera- 
ture and also some of the quantum information literature 
the term unique game may also refer to a game for which 
for any pair of questions (x, y) there exists a permuta- 
tion TT XtV G {l,..,,d} over the set of possible answers 
such that Alice and Bob win if and only if their answers 
obey a = it x , v (b). In terms of the predicate this means 
that V(a, b\x,y) = 1 if and only if b = 7r XiJ ,(a). Note that 
in this language, every unique game is a projection game 
because there is only one correct answer for Bob for each 
x,y, and a. However, not every projection game forms a 
unique game. 

A more general notion which imposes a limit 
on th e number o f win ning answers are k-to-k' 
games (|Kempe et all l2010h . More precisely, a game is 
k-to-k' if for all questions x and y the following two con- 
ditions hold: for all answers a of Alice there exist at most 
k winning answers for Bob, and for all answers b of Bob 
there exist at most k' winning answers for Alice. A pro- 
jection game is thus a k-to-k' game for k = k' = 1. 



d. Other special classes of games. Several other special 
classes of games have been studied on occasion. A lin- 
ear game is a game for which one can associate the set 
of possible answers {1, . . . , d} with an Abclian group 
H of size d and find a function W : {l,...,m} x2 — » 
H such that V(a, b\x, y) = 1 if and only if a — b = 
W(x,y). Any linear game is a unique game, and has 
been shown to have the special property to be a uni- 
form game, that is, a game in which there exists an 
optimal quantum strategy such that the marginal dis- 
tributions p(a\x) and p(b\y) are t he uniform d i stribu - 
tions over Ra and Rb respectively ( Kempe et aZ.I . [2O10h . 
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Furthermore, a game may be called free if the ques- 
tions are drawn from a product distribution, that is, 
ir(x, y) = tta(x) x irniy) for so me distributions tta and 
ttb (jKempe and Vidickl . l2011al) . A game is called sym- 
metric if for all questions x,y and all answers a,b, we have 
V(a,b\x,y) = V( b a\y,x ) (jDinur and Reingoldl . 120061: 
iKempe and Vidickl . l2011al) . An example of a game that 
is both free and symmetric is given by the CHSH game 
above. Another class of games that has drawn attention 
in the computer science literature is characterized merely 
by the fact that there exist a quantum strategy that wins 
the game with probability p w - m = 1. Such games are 
sometimes also called Kochen-Specker games (or pseudo- 
telepathy games or Greenberger-Horne-Zeilinger games) 
due to the fact that the optimal quantum strategy yield s 
a so-called Kochen-Specker set ( Renner and Woli . 12004) . 
a concept in conte xtuality which i s outs ide the scope 
of this survey (see ( Brassard et all |2005|) for a survey 
on such games, see also the related discussion in sec- 
tion uteJ). 



C. Bell inequality violations 

In the above discussion, we saw that it is in princi- 
ple possible to decide (albeit very inefficiently) whether 
a given behavior is local and to compute the local bound 
Si of an arbitrary Bell expression. In this section, we 
look at the analogous problem in the quantum and no- 
signaling case. We review the existing methods for com- 
puting the quantum and no-signaling bounds, i.e., the 
maximal quantum and no-signaling violations, of an ar- 
bitrary Bell expression s. Such methods can also be used 
to determine if a given behavior admits a quantum or 
no-signaling representation and thus this section is more 
generally concerned with the problem of practical charac- 
terizations of the quantum and no-signaling sets beyond 
the formal definitions (jT2"j) and (J7]). 



1. Quantum bounds 

a. Properties of quantum correlations. Before discussing 
in more detail how one can compute the quantum bound 
S q of a Bell expression, let us briefly discuss the general 
structure of the quantum set Q. Recall that a behavior 
p is quantum if, as defined in Eq. (|14[) it can be writ- 
ten as p(ab\xy) = {ip\M a \ x M b \ y \ip) where is a state 
in a Hilbert space H, and M a \ x and M b \ y are orthogonal 
projectors on H defining proper measurements and satis- 
fying [M a | x , M^ y ] = 0. (For characterizing the quantum 
set it is convenient to assume we impose commutation 
relations rather than a tensor product structure and we 
will follow this approach in the remainder of this section.) 

As we already mentioned, the local set C is strictly 
contained in the quantum set Q, i.e., there are quan- 
tum behaviors that are nonlocal, and thus in general 
S q > Si. There are two basic requirements that any 



quantum behavior must satisfy to be nonlocal. First 
Alice's different measu rements mu st be non-commuting 
as well as Bob's ones (IFind . Il982h . Second the state p 
must be entangled. Without surprise, quantum nonlocal- 
ity can thus be traced back to the two features usually 
seen as distinguishing quantum from classical physics: 
non-commutativity and entanglement. 

Contrarily to the local set, the set Q of quantum cor- 
relations is generally not a polytope. It can therefore 
not be described by a finite number of extreme points or 
a finite number of linear inequalities. It is not difficult 
to see though that all extremal points of C, i.e the local 
deterministic behaviors, are also extremal points of Q. 
Furthermore certain faces of C are also faces of Q. An 
example is provided by the d — 1-dimcnsional face asso- 
ciated to the hyperplanes p{ab\xy) = (note, however, 
that the corresponding Bell inequalities, p(ab\xy) > 0, 
can not be violated by any physical behavior). Thus 
while Q is not a polytope, its bou ndary contains some 
flat regions. In ( Linden et it is shown that the 

local and quantum set have common faces which corre- 
spond to Bell inequalities that can be violated by certain 
no-signaling behaviors. As we mentioned earlier, it is 
an open question whether there exist such examples of 
maximal dimension, i.e. whether there exist facets of C 
corresponding to Bell inequalities that are not violated 
by Q but which can be violated by ATS (such exampl es 
are known in the tripartite case ( Almeida et aUfeoiOal) ). 



The boundary of the nonlocal part of Q may also con- 
tain flat regions, i.e., the maximal violation of a Bell 
inequality may sometimes be realized with two or more 
different non-local quantum behaviors. The question of 
when an extremal quantum behavior can be realized by 
a unique quantum representat ion (up to unitary equiva- 
lence) has been considered in ( Franz et all 1 2 llh . where 
it has in particular been shown that in the correlation 
space C all non-local extremal behaviors are uniquely 
realizable in the cases m = 2 and m = 3. Exam- 
ples of non-correlation Bell inequalities that are maxi- 
mally vio lated by a unique quantum behaviors have been 
given m (|Acm et all 1201 2l ). These inequalities are max- 
imally violated by partially entangled states, thus show- 
ing that these state are necessary to cha racterize the 
boundary of the quantum region , see also ( Liang et all 
1201 li IVidick and Wehnerl . 12011) Note, however, that 
the s o-called embezzling state ( van Dam and Havdenl . 
l2003f ) is universal in the sense that any two-party Bell 
inequality can be maximally violate d using an embez- 
zling state ( de Oliveira Oliveirall2010[ ) up to a small error 
term. 

Let us now focus more specifically on the problem of 
computing the quantum bound of a Bell expression. Re- 
call that Q as any convex compact set can be described 
by an infinite system of linear inequalities of the form 
s • p < S q , here the quantum Bell inequalities. Given an 
arbitrary Bell expression s, its corresponding quantum 
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bound is given by 



max s 
pes 



max 1151 
s 



where 



5 = E 

abxy 



s ab M i 

a xy lyl a\Q 



b\y 



(30) 



(31) 



is the Bell operator associated to s, 1 |<S| | denotes the spec- 
tral norm (largest eigenvalue) of S, and the above opti- 
misation is performed over all possible Bell operators iS 
associated to s. That is, over all possible measurements 
{M a \ x }a and {M b \ x }b where the coefficients s X y are given 
by the choice of s. In the case of the CHSH expression, 
the Bell operator takes the form S = A\(Bi + B^) + 
A 2 (Bi — B 2 ), where A x B v are arbitr ary ±l-eigenvalued 
observables. Following ( Landaul . ll987l ). we can der i ve the 
quantum bound of the CHSH inequality ( Cirel'sorl Il980l ) 
by noting that S 2 — 4 + [Ax, A 2 ][B 1 , B 2 ], from which it 
follows that ||5 2 || < 8 and hence ||5|| < 2^/2. Computing 
the bound of other Bell inequalities is a more complicated 
business. It has been shown to be an NP -hard problem 
in the tripartite case ( Kempe et all l201l[) . 



In particular, the quantum bounds for the CHSH inequal- 
ity and the chained inequalities (inequality (|23l) in the 
case d = 2) can easily be obtained in this way ( Wehnerl . 
l2006bl ). It should be noted that this SDP technique can 
also be generalized to yield bounds for other Bell inequal- 
ities, however these are usually not tight and could in fact 
be very far from optimal. 

In the d — 2, m = 2, this SDP approach can be used 
to yield a complete description of Q in the correlation 
space C in terms of a finite set of non-linear inequalities: 
a behavior is quantum if and only if satisfies 

|asin(AiBi)+asin(AiB 2 )+asin(A 2 Bi)-asin(A 2 S2)| < t 

(33) 

together with the inequalities o btained by permuting the 
(A x By) in the above expression (ICirel'so dL 119801 : iLandauL 
119881: IM asane sl l2003l : lTsirelsonlll987D . For further results 
and a more detailed discussion of the charact erization of 
Q in the corre l ation space C, we refer to ( Avis et all . 
120091 ITsirelsonl Il987t fl993h . 

It is interesting to note that it is much harder to de- 
termine the optimal local bound Si for a correlation Bell 
inequality t han it is to compu te the quantum one un- 
less P=NP (|Cleve et all 12004 . That is, the quantum 
problem is actually easier than the classical one. 



b. Correlation inequalities. The case of quantum correla- 
tions inequalities defined in the correlation spa ce C is par- 
ticularly well understood th anks to Tsirelson ( Cirel'sonl . 
119801: ITsirelsonl . Il987t Il993l) . Recall that in the corre- 
lation space, a behavior is defined by the m 2 correla- 
tors (A x B y ). It is easy to see that such a behavior 
is quantum if we can write (A x B y ) = (ip\A x <S> B y \ip) 
for some quantum state \ip) in Ha ® Hg and some ±1- 
eigenvalued quantum observables A x on Ha and B y on 
Hb- Tsirelson showed that it is sufficient to consider 
dim Ha = dim Hb = 2™ if m is even and dim Ha = 
dim Hb = 2" l+1 if m is odd, and to be a maximally 
entangled state in Ha <8> Hb- Furthermore, he showed 
that the m 2 correlators (A x B y ) are quantum if and only 
if there exit 2m unit vectors v x and w y in R 2m such that 

(A x B y ) = v x ■ w y (32) 



for all x, y G {1, . . . , m}. This last representation is par- 
ticularly useful, as deciding if a behavior can be writ- 
ten in the form (|32[) can be cast as a semidefinite pro- 
qram (SDP) for which efficient alg orithms are available 
(icieve et aLl . l2004tlWehne"n . l2006bD . This means that the 
problem of computing p wln is in the complexity class EXP 
(exponential time, since SDPs can be solved in polyno- 
mial time b ut the inpu t is of expon ential siz e ). How ever, 
combining (IJain et al I I2010D and flWehnerl . l2006aft one 
now knows that the problem of computing the winning 
probability for XOR games lies in the complexity class 
PSPACE. 

This technique can be used to compute tight bounds 
for 2-outcome correlation inequalities, i.e., XOR-games. 



c. State and measurement dependent bounds. Let us now 
go back to the general case of quantum correlations in the 
probability space V . To compute the quantum bound 
(f3"0"| of a Bell expression, a first simple approach is to 
introduce an explicit parametrization of a family of Bell 
operators S in an Hilbert space H = Ha £3> Hb of fixed 
dimension dimH = d^, and to maximize ||iS|| over all 
operators in this family. In general, however, we have 
no a priori guarantee that the optimal quantum bound 
can be realized using a Bell operator from this particular 
family. Furthermore, most optimization methods cannot 
guarantee convergence to the global extremum. This ap- 
proaches therefore typically only yields lower-bounds on 
S q . It is nevertheless very useful when looking for an ex- 
plicit quantum violation of a Bell inequality s • p < Si 
(though wc will have no guarantee that this is the optimal 
quantum violation). 

Rather than directly trying to obtain a state- 
independent bound by maximizing the norm of the Bell 
operator, it is often easier to compute the quantum 
bound for a fixed quantum state i.e. maximize 
{ip\S\ip) over all Bell operators S. This optimization 
can be dealt with as above by introducing an explicit 
parametrization of a famil y of Bell operators. An ot her 
possibility, introduced in ( Liang and Dohertvl . l2007t ) , is 
to exploit the fact that, for a given quantum state, a Bell 
expression is bilinear in the measurement operators, that 
is, it is linear in the operators {Afa^} for fixed {M^y} 
and linear in the {M^y} for fixed {M a \ x }- When the 
measurements on one system are fixed, the problem of 
finding the optimal measurements for the other system 
can therefore be cast as a SDP. This SDP can then be 
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used as the basis for an iterative algorithm: fix Bob's 
measurements and find Alice's optimal ones; with these 
optimized measurements for Alice now fixed, find the 
optimal ones for Bob; then optimize again over Alice's 
measurements and so on, until the quantum value con- 
verges within the desired numerical p recision. A similar 
iterat ive algorithm was introduced in ([Werner and Woli 
120011) for correlation inequalities. In this case, once the 
measurements for one party are fixed, optimization of 
the other party's measurements can be carried out ex- 
plicitly. This turns out to be true not only for corre- 
lation inequaliti es but for any Bell expre ssion with bi- 
nary outcomes ( Liang and Dohertvl 120071 ). Finally, let 
us note that a method for finding an optimal Bell op- 
erator for a fixed quantum state, can again be used in 
an iterative algorithm to find a state-independent bound 
( Pal and Vertesll2009h : starting with an initial quantum 
state (e.g. a maximally entangled state), find the corre- 
sponding optimal Bell operator; find then the optimal 
quantum state associated to this Bell operator (i.e. the 
eigenvector associated to the largest eigenvalue); repeat 
these steps starting from this new state. 



d. General bounds. The techniques that we just de- 
scribed provide lower-bounds on S q . Looking at ((501) it 
becomes clear that finding S q can be understood as an 
instance of polynomial optimization. More specifically, 
we wish to optimize pip over non-commutative variables 



M a \ x , M b \ y subject to certain constraints, namely that 
such variables form quantum measurements and Alice 
measurement operators commute with those of Bob. It 
is known that in principle any polynomial optimization 
problem in commutative variables can be solved using a 
hierarchy of SDPs — two general metho ds that are dua l 
to e ach othe r have been introduced by ([Lasserr 2 12001b 
and ( Parrilol |2003|) respectively. 

It turns out that these tec hniques can be ex- 
tended to the quantum sett ing ( Dohertv et all 120081 : 
iNavascues et all 120071 120081 ). yielding a powerful ap- 
proach to obtaining upper-bounds on S q , i.e., of deriv- 
ing constraints satisfied by the entire q uantum set. This 
meth od was originally introduced in (|Navascues et all 
l2007t ). which follows the ideas of (|Lasserrell200ll) . The 
idea is basically the following. Let \tp) and {M a \ x }, 
{M(,\ y } define a quantum realization of a behavior p € 
Q', i.e. p{ab\xy) = {ip\M a \ x M b \ y \ip) . Let O be a set 
of k operators consisting of all the operators M a i x and 
M b \ y together with finite products of them. For in- 
stance O may consist of all operators of the form M a \ x , 
A%, M a[x M a , w , M alx M bly , M b \ y M b ,\ y ,. Denote by O t 
(i = 1, . . . , k) the elements of O and introduce the k x k 
matrix T with entries Tjj = (ip\OlOj\tp), called the mo- 
ment matrix associated to O. Then the following proper- 
ties are easily established (independently of the particu- 
lar quantum realization considered): i) V >z is semidef- 
inite positive, ii) the entries of T satisfy a series of linear 
inequalities, Hi) the probabilities p(ab\xy) defining the 



behavior p correspond to a subset of the entries of T. 
A necessary condition for a behavior p to be quantum 
is therefore that there exist a moment matrix T with 
the above properties, a problem that can be determined 
using SDP. For any O, the set of behaviors p € V for 
which there exist such a moment matrix thus define a set 
Qo that contains the quantum set Q! (and thus also Q). 
Optimizing a Bell expression (which is linear in p) over 
this set Qo is also a SDP and yields an upper-bound on 
S q . Consider in particular the case where O is the set 
of all operators consisting of product of at most v of the 
operators M a \ x and M b \ y and denote the corresponding 
set of behaviors Q v . Then the associated SDP defines 
for v = 1,2,... a hierarchy Q\ C Q 2 C ... C Q of re- 
laxations approximating better and better the quantum 
region from the outside (See Fig. [3]). Or, cquivalcntly, 
they define a decreasing series of upper-bounds on the 
quantum bound S q of any Bell expression. 

Subsequently , follow ing the ideas of ( Parrilol 120031) . 
( Dohertv et all 20081) constructed th e SDP hierarchy 
that is dual to ( Navascues et aU^OO^ . It relies on the 



fact that for any Bell operator S we have p = b q t — S > 
(i.e., ||iS|| < b q ) if and only if the polynomial p can be 
written as a (weighted) sum of squares of other polyno- 
mials. We can thus think of minimizing b q such that p 
is a sum of squares of polynomials in order to find ||B||. 
If we limit the degree of these polynomials, the problem 
can be cast as an SDP. Very roughly, at level £ of the 
SDP hierarchy we then limit the degree to be at most 2£, 
leading to better and better bounds for increasing values 
oil 

In (iDohertv et all 120081 : INavascues et all l2008h . it 
is shown that this hierarchy of SDP relaxations con- 
verges in the asympt otic limit to the set Q' (see also 
( Pir onio et al 1 120101J) for a more general approach not 
limited to quantum correlations). It is also possible to 
certify that a behavior p belongs to the quantum set Q 
or obtain the optimal bound S q of a Bell expression at a 
finite step in the hierar chy (see e.g. ( Dohertv et aZ.I . [20081 : 
INavascues et al. I. I2008D for a n umber of examples) . A cri- 
terion has been introduced in ( Navascues et all l2008h to 
determine when this happens and to reconstruct from the 
moment matrix T a quantum realization of this optimal 
solution in term of an explicit state \ip) and operators 
M a i x and M b \ y . Optimality at a finite step in the hi- 
erarchy can also be determined by comparing the SDP 
upper-bound with a lower-bound obtained by search- 
i ng over explicit famili es of quantum Bell operators. In 
(|Pal and Vertesil . l2009t) . for instance, the optimal quan- 
tum value S q of 221 Bell expressions has been determined 
in this way at the 3rd step of the hierarchy. Even if they 
do not always provide an optimal bound, numerical ex- 
amples show that low-order steps of the hierarchy usually 
already approximate very well the quantum bound. In 
( Kempe et all I20T0I ) . it is proven that for a certain par- 
ticular family of Bell scenarios, known as unique games 
(see section III.B.4.c[) , the first step of the hierarchy al- 
ways provides a good approximation of the quantum set. 
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FIG. 3 Hierarch y of sets Q v generated by the hierarchy of 
SDPs defined in (jNavascues et aZl 120071 ) (see text). Each set 
in the hierarchy approximates better the set of quantum corre- 
lations Q. In the CHSH scenario, the set Qi already achieves 
the maximum quantum value of the CHSH inequality, i.e., 
Tsirelson's bound. 



Let us also note that for correlation inequalities, the first 
step of the hierarchy always provides the optimal solu- 
tion as it equivalent to the SDP approaches based on 
Tsirelson results mentioned earlier. 

In the d = 2, m — 2 case, the set Qi corresponding 
to the first step of the hierarchy has b een analytically 
characterized in ( Navascues et aZ.I . l200?t) . A behavior p 



belongs to Qi if and only if (Ai) 2 = 1 or (Bj) 2 = 1 for 
some i, j — 1, 2 or if it satisfies the inequality 

|asin(£)n) +asin(Di2) +asin(Z?2i) — asin(Z?22) | < t (34) 

together with the inequalities obtained from this one by 
permuting the D^-, where 



Especially in the case of multipartite correlations it is 
sometimes convenient to compute S ns to obtain a (crude) 
bound for S q . 

Finally, let us remark that since MS is defined by a 
finite number of linear inequalities, it is, as the local set, 
a polytope and can also be described as the convex hull 
of a finite set of vertices. These can be obtained from 
the list of facets (the inequalities p(ab\xy) > 0) using 
the same polytope algorithms that allow one to list the 
facets of C given its vertices. The vertices of £, the local 
deterministic points d A , are clearly also vertices of MS 
(since they cannot be written as a convex combination of 
any other behavior). All other vertices of MS are non- 
local. 

The geometry of the no-signaling set and its relation 
to C is particularly simple for the d = 2,m = 2 Bell 
scenario. In this case, the no-signaling behaviors form 
an 8-dimensional subspace of the full probability space 
V . The local polytope consists of 16 vertices, the local 
deterministic points, and 24 facets. 16 of these facets 
are positivity inequalities and 8 are different versions, 
up to relabeling of the inputs and outputs, of the CHSH 
inequality. The no-signaling polytope, on the other hand, 
consists of 16 facets, the positivity inequalities, and 24 
vertices. 16 of these vertices are the local deterministic 
ones and 8 are non-local vertices, all equivalent up to 
relabeling of inputs and outputs to the behavior 



(AjBj) — (Aj)(Bj) 

y/(l-(Atf)(l-(Bjy' 



(35) 



p(ab\xy) 



1/2 if a © b = xy 
otherwise 



(36) 



The non-linear inequalities (|34p thus form a necessary 
condition for a behavior to be quantum. They strengthen 
the inequalities (|33|) to which they reduce when (Ai) = 



(Bj) = 0. 



2. No-signaling bounds 



Let us now consider the problem of computing bounds 
on Bell expressions for no-signaling correlations. Con- 
trary to the case of local and quantum correlations, this 
turns out to be a rather easy task. To understand why 
note that, as already mentioned, once the no-signaling 
constraints ([TJ are taken into account, e.g. by intro- 
ducing a parametrization of the relevant affine subspace 
R , the set MS of no-signaling behaviors is uniquely 
determined by the set of d 2 m 2 positivity inequalities 
p(ab\xy) > 0. Deciding whether a behavior belongs to 
MS can thus easily be verified by checking that all pos- 
itivity inequalities are satisfied. Since there are d 2 m 2 
such inequalities, this is a problem whose complexity 
scales polynomially with the number of inputs and out- 
puts. More generally, linear programming can be used to 
determine efficiently the no-signaling boun d S ns of an ar - 
bitrary Bell expression s, as used, e.g. in ( Tonerl . [20091) . 



which is usually referred to as a PR-box. It is easily 
verified that the PR-box violates the CHSH inequality (|4]) 
up to the value s ■ p = 4 > 2, the algebraic maximum. In 
the language of games, this means that the CHSH game 
can be won with probability p^m = 1- There exists 
a one-to-one correspondence between each version of the 
PR-box and of the CHSH inequality, in the sense that 
each PR-box violates only one of the CHSH inequalities . 
The PR-box was introduced in ( Khalfin and Tsirelsonl . 
119851: iPopescu and Rohrlichl 119941 iRastalll Il985h . Since 
the maximal quantum violation of the CHSH inequality 
is 2v2, it provides an example of a no-signaling behavior 
that is not quantum, implying that in general Q ^ NS. 
The relation between £, Q, and MS in the d = 2, m = 2 
case is represented in Figure |4] 

The complete list of all no-signaling vertices is also 
known in the case o f two inputs (m = 2) a nd an arbitrary 
number of outputs (jBarrett et al\ . l2005bf ) and in the case 
of two outputs (d = 2 ) and an arbitrary number o f input s 
( Barrett and Pironi <J 120051: [J ones and Masanesl l2005h . 
In both cases, the corresponding non-local vertices can 
be seen as as straightforward generalizations of the PR- 
box. 
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FIG. 4 A two-dimensional section of the no-signaling poly- 
tope in the CHSH scenario (m = d = 2). The vertical axis 
represents the CHSH value S, while the horizontal axis rep- 
resents the value of a symmetric version of the CHSH expres- 
sion 5". Local correlations satisfy \S\ < 2 and \S'\ < 2. The 
PR box is the no-signaling behavior achieving the maximum 
CHSH value S=4. Tsirelson's bound corresponds to the point 
where S = 2y/2, i.e., the maximum CHSH value that a quan- 
tum behavior can achieve. 



D. Multipartite correlations 

Though we focused for simplicity in the preceding sec- 
tions on Bell scenarios involving n = 2 systems, most of 
the above definitions and basic results extend straight- 
forwardly to the case of an arbitrary number n > 2 of 
systems. For instance, in the tripartite case a behavior 
p(abc\xyz) is no-signaling when 



equalities with m = 2 inputs have been derived in 
(IWerner and Woli |2002[ ) and (jZukowski and Bruknerl . 
120021) for an arbitrary number n of parties. There 
are 2 2 such inequalities (with redundancies under re- 
labelling) which can be summarized in a single, but non- 
linear inequality. Notable inequalities that are part of 
this fami l y are the Mermin inequalities intr oduced in 
(iMerminl Il990a|) and further dev eloped in ( Ardehahl 
Il992t iBelinskii and Klvshkol Il993h . In the case n = 3, 
the Mermin inequality takes the form 

K^xSaCa) + (A 2 B 1 C 2 ) + {A 2 B 2 C 1 ) - (A&d)] < 2. 

(38) 

It is associated to the GHZ paradox (see section III.E[) 
in the sense that correlations that exhibit the GHZ 
par adox violate it up to the algebraic bound of 4. 



xyz) 



p(abc'\xyz') Vo, b, x, y, z, z' (37) 



par adox violate it up to the algebraic bound ot 4. 
In (IWerner and Woli l2002ri the structure of the quan- 
tum region in the full correlation space is also inves- 
tigated. In particular, it is shown that the quantum 
bound of all inequalities introduced in ( Werner and Woli 
l2002t IZukowski and Bruknerl l2002h is achieved by the n- 
partit e GHZ state (|00 ... 0) + [11 . . - l))/v / 2- 

In ( Sliw 1 l2003h . all facet Bell inequalities (in the 
full probability space) have been derived for 3 parties 
in the case d = 2, m = 2. There are 46 inequivalent 
such inequalities. All of these are violated in quan- 
tum mechanics , excep t for the inequality consi dered in 
(jAlmeida et all l2010al) . In (jPironio et all 120 111) , all ver- 
tices of the no-signaling polytope corresponding to the 
same Bell scenario have been listed. Interestingly, they 
are also 46 ineq uivalent classes of no-signaling vertices. 
In (jFritd . l2012bt) . it is shown that there actually exists a 
bijection between facet Bell inequalities and no-signaling 
vertices for every Bell scenario with two inputs and out- 
puts, independently of the number of parties. 

Evidently, the structure of non-local correlations is 
much richer (and less understood) in the multipartite 
case than in the bipartite one. In particular, there ex- 
ist different definitions of non-locality that refine the 
straightforward extension of the bipartite definition. 
This question and other ones that are more specific to 
the multipartite scenario are discussed in section [VTl 



and similar relations obtained from permutations of the 
parties; a behavior is local if it can be written as a convex 
combination of a finite number of deterministic behaviors 
d x (abc\xyz)] Bell inequalities correspond to faces of the 
corresponding polytope; and so on. Below, we discuss 
a few notable results obtained in the multipartite case. 
Note that many references cited in the previous subsec- 
tions also contain results for more than 2 parties. 

As in the bipartite case, one can consider Bell inequal- 
ities that involve only full correlators in the case where 
all measurements have binary outcomes. In the n = 3 
case, for instance, such an inequality would involve only 
terms of the form (A x B y C z ) = b c _ ±1 abcp(abc\xyz), 
and similarly for more parties. All correlation Bell in- 



E. Nonlocality without "inequalities" 

To demonstrate that some quantum correlations p are 
non-local it is sufficient, as discussed in the previous sub- 
sections, to exhibit a Bell inequality that is violated by 
p. In certain cases, however, it is possible to show di- 
rectly that quantum predictions are incompatible with 
those of any local model via a simple logical contradiction 
that does not involve any inequality (though such argu- 
ments can obviously always be rephrased as the violation 
of a Bell inequality). Here, we present two examples of 
such "Bell's theorem without inequalities" , namely the 
Greenbcrgcr-Horne-Zeilinger paradox and a construction 
due to Hardy. 
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The sit uation considered by (iGreenberger et aZ.I. Il989h 
(see also ( Greenberger et aLl . ll990l : Merminl . 1990bl )) in- 
volvcs three players Alice, Bob and Charlie. Each player 
receives a binary input, denoted by Ai, Bi, and Cj, with 
i = 1,2. For each input, players should provide a binary 
output: ±1. With a slight abuse of notation, let us de- 
note by Ai = ±1 the answer to the question "Ai" and so 
on. Suppose that the players share a state of the form 
\GHZ) = 4=(|000) + |111», and when they receive input 
'1' ('2') they perform a local Pauli measurement a x (cr y ). 
It is not difficult to see that their measurement outcomes 
will always satisfy the following relations 



the form 



|^> = a(|01) + |10))+/?|00>, 



(40) 



A1B1C1 = 
A1B2C2 = 
A2B1C2 = 
A2B2C1 



■ I 

-1 
-1 

-1 



(39) 



Let us contrast these quantum predictions with those of a 
local model, where the answer of each party only depends 
on his question and of some shared random data A. Since 
the products of all the questions in (|3"9"]) is either exactly 
+1 or —1, each answer must clearly be a deterministic 
function of the local question and of A, otherwise the 
model would not be able to reproduce the perfect corre- 
lations of (|3"9"|) . For fixed A, a local model thus amounts 
to assigning a definite value ±1 to all of the variables Ai, 
Bi, and Ci. But then this is in direct contradiction with 
the conditions ([59")) . To see this, consider the product of 
all four left- hand-side terms. Since A\ = Bf = Cf = 1, 
this product is necessary equal to 1, but the product of 
the right-hand side is —1. This argument demonstrates 
in a simple way the incompatibility between the predic- 
tions of quantum theory and those of any local model. 

Note that the above GHZ paradox can be recasted as 
the violation of Mermin's inequality, given in Eq. (|38[) . 
i.e., the GHZ correlations (|39p violate the inequality 
(|38[) up to its algebraic maximum 4. In the language 
of non-local games, it provides an example of a game for 
which there exists a quantum strategy that wins it with 
probability p w i n = 1 (see section III.B .4[) . GHZ para- 
doxes of the above types are also know n as "pseudo- 
telepathy game s" dBrassard et all. 2005) or "Kochen - 
Specker games" (|Mermid . ll993tlRenner and Woli \2004 . 
Other multipartite GHZ-type paradoxes, as well as a 
more detailed discussion of the nonlocal correlations of 
GHZ states can be found in section IVI.D.2I Notable 
examples of "nonlocality proofs without inequalities" of 
the GHZ-type but in th e bipa rtite case are presented in 
(lAravindl . 120021 : ICabellol . l200lh . 

Another interesting demonstration of q uantum nonlo- 
cality without inequalities was given in ()Hardvl . I1993D . 
Here the contradiction requires partial entanglement; in 
fact, it can be obtained if and only if the state is not max- 
imally entangled. This represents one of the first hints 
that entanglement and nonlocality are not monotonically 
related (see section HlI.A.7|) . 

Consider a partially entangled state of two qubits of 



where 2\a\ 2 + \/3\ 2 = 1. Alice and Bob can both perform 
two possible measurements. For the first measurement of 
both parties (denoted Ai and Bi), the outcome '+1' is 
associated to a projection on the state cos 6* |0) +sin0 |1). 
The second measurement of both parties (denoted A2 
and B2) is in the logical basis: outcome '+1' is associ- 
ated to a projection on |0), and outcome '-1' with |1). 
By setting a = —f3 cot 6, wc ensure that when Alice 
performs the first measurement and obtains the result 
'+1', Bob's particle is projected in the state |1). Hence 
if Bob then performs the second measurement, he will 
always find outcome '-1'. Similarly, if Bob performed the 
first measurement and found outcome '+1', then Alice 
will always find '-1' when she performs the second mea- 
surement. When both parties perform the first measure- 
ment, the pair of outcomes (+1, +1) occurs with a strictly 
positive probability. Finally, when both parties perform 
the second measurement, the pair of outcomes (—1, —1) 
never occurs. It is then not difficult to see that a local 
model can never reproduce the above series of events, 
which form a frustrated chain. Generalizati ons and dis- 
cussions on Hardy's paradox can be found in (|FritzLl20i:H 
iRabelo et al\ . 12012ft and references therein. 



F. Quantifying nonlocality 

So far, we have mainly discussed the problem of de- 
tecting nonlocal correlations, i.e. determining whether 
given correlations p belong to £ or not. Another rele- 
vant question is how to quantify nonlocality. 

A choice that is often made to quantify non-locality 
is through the amount of violation of a Bell inequality, 
i.e., p is more non-local than q if s • p > s ■ q for some 
Bell expression s. The problem with this approach is 
that there can be another Bell expression s' such that 
s' • q > s' • p. Another problem is that a given Bell 
inequality can be written in many equivalent ways using 
the normalization conditions 1-p = Yl a bxyP( a ^\ x y) = in 2 
(we recall that m denotes the number of possible inputs 
x and y). Let for instance s be the CHSH expression 
dU), for which Si = 2 and S q = 2y/2. Consider the Bell 
expression s Q = a s+(l— a)/2 1 obtained from the CHSH 
expression through irrelevant rescaling and addition of an 
offset. For any p, we thus have s a ■ p = a s • p — 2a + 2, 
which implies that the local bound Sf = 2 of the new Bell 
expression is identical to the one of the original CHSH 
expression, but now its maximal quantum violation S q = 

2 + 2(y(2) — l)a can artificially be made arbitrarily large 
for increasing a. 

If the amount of violation of Bell inequalities is used 
to quantify non-locality, this amount of violation must 
thus first be normalized in some proper way. If this nor- 
malization is well-chosen, one can then often relate the 
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amount of violation of Bell inequalities to an operational 
measure of non-locality. 

Possible operational measures of non-locality are sim- 
ply given by the tolerance of non-local cor relations to 
the addition of noise 



such as white noise (1 Acin et al 
20021; iKaszlikowski et al\. 2000h . local noise ( Junge et at. 
2010bl;|P erez-Garcia et al\. 120081) , or de tection inefficien- 



cies (jMassarl 120021; iMassar et a71 1200 



is shown in ( Perez-Garcia et al\ , 1200 



. In particular, it 
(see also section 
IIII.B.2[) that the tolerance of p to any local noise, defined 
as the minimal value of r such that rp + (1 — r)q ^ C for 
all q <E C, is given by r = 2/(V + 1) where v is the max- 
imal possible violation by p of a Bell inequality, defined 
in the following way 



v = max ■ 



s-p| 



max qe£ |s • q| 



(41) 



Note that taking the ratio and the absolute value is cru- 
cial for a meaningful definition of this amount of viola- 
tion. If instead of the ratio, one takes for instance the 
difference, a change of scale s — > 7s would lead to ar- 
bitrary violations. If one removes instead the absolute 
value, the same happens via an offset, as in the example 
discussed above. 

Another operational measure of the non-locality of cor- 
relations p is given by the amount of classical communi- 
cation between the two wings of the Bell experiment by 
which a local model has to be supplemented for rcpro- 
ducing these correlations . This approach was adopted in 



(iBacon and Tonerl. 2003 ; Brassard et al. |jl99dlMaud"lm] . 
ll992l ; ISteinerlj2000l;lTo"ner and Bacorll2003l) (see also dis- 
cussion in section ITTT7C]) . In (iPironiol 120031 ). it is shown 



that any Bell expression s can be rewritten in a normal- 
ized form s* - through an appropriate change of scale 
and an offset - such that the minimal average amount of 
classical communication C(p) necessary to reproduce p is 
given by C(p) = max s * s* • p. Techniques for estimating 
the communication complexity of arbitrary no-signaling 
correlations and t heir relation to Bell v iolations are fur- 
ther developed in ( Degorre et all l201l[ ) . 

Finally, a third proposed approach to measure 
nonlocality is thro ugh its "statistical strength" 
( van Dam et aZl. l2005h : that is, the amount of confidence 
that the measurement outcomes of n independent Bell 
experiments governed by a behavior p could not have 
been reproduced by a local behavior. Indeed, statistical 
fluctuations on a finite sample allow for the possibility of 
apparent Bell inequality violations even by a local model 
(this issue for the interpretation of Bell experiment 
results is specifically discussed in section IVII.B.3|) . In 
an experiment, the goal is to test in a finite number 
of trials whether the system is governed by the laws of 
quantum mechanics (hypothesis QM), or whether the 
system obeys a Bell local model (hypothesis LOC). The 
statistical tool that quantifies the asymptotic average 
amount of support in favor of QM against LOC per 
independent trial i s the Kullback-Leibler (or relative en- 
tropy) divergence ( van Dam et aUl2005l ). This quantity 



can be see as a distance D(p) between a given behavior 
p and the set of local behaviors. 

The statistical strength of the mo st common nonlo- 
cality tests have bee n estimated in ( Acin et~al\ . 120051 : 
Ivan Dam et all l2005h and are summarized here in Ta- 
ble |U It is worth to note that CHSH scenario is the 
strongest test among b ipartite Bell tests involving qubits 
( van Dam et Ml, l2005ri . When, c onsidering higher di- 
mensional systems, optimal tests ( Acm et al\ . |2005|) in- 



volve partially entangled states (rather than maximally 
entangled ones), illustrating the astonishing relation be- 
tween entanglement and nonlocality (see section lIlLA.7p . 
Finally, the Mcrmin-GHZ test (sec section Hl.D[) . involv- 
ing three qubits appears to be much stronger than the 
considered bipartite Bell tests ( van Dam et all 120051 ) . 



TABLE I Kullback-Leibler divergence for the most common 
quantum Bell tests. ME stands for maximally entangled. 



Bell inequality 


Quantum state 


KL divergence (bits) 


CHSH 


ME 2-qubit 


0.046 


CGMLP 


ME 2-qutrit 


0.058 


CGMLP 


Optimal 2-qutrit 


0.077 


Mermin-GHZ 


GHZ 


0.208 



G. Multiple-rounds and parallel repetition 

So far, we have characterized the predictions p = 
{p(a, b\x, y)} of local, quantum, or no-signaling systems 
in single-round Bell experiments where a single choice of 
input pair (x, y) is made and the two devices produce 
a single output pair (a, b). More generally, we can also 
consider multiple-round Bell experiments in which a se- 
quence (x-y, yi), . . . , (x n ,y n ) of input pairs is used in the 
two devices, resulting in a sequence (ay, by), . . . (a n , b n ) 
of output pairs. A physical model for such an experi- 
ment will thus be characterized by the joint probabili- 
ties p„ = {p(aybi . . . a n b n \xiyi . . . x n y n )}. The motiva- 
tion for considering such multiple-round Bell scenarios is 
clear: it corresponds to the situation of real experimen- 
tal tests of Bell inequalities in which the two quantum 
devices are probed many times to gather sufficient mea- 
surement statistics. 

Three general multiple-rou nd scenarios can be distin- 
guished ((Barrett et q/l l2002h . First, the n output pairs 



can be obtained by measuring n independent 6 systems. 
Effectively, this means that the measurement settings are 
applied sequentially, i.e., the next input pair is intro- 
duced in the two devices after outcomes have been pro- 



6 Note that we allow though some correlations between the dif- 
ferent systems through some global shared randomness A, see 
definition of further in the text. The n systems are only 
independent with respect to sharing the inputs and outputs. 
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duccd for the previous round, and furthermore the de- 
vices have no memory of the previous round. In this sce- 
nario, we say that p„ is local, which we denote p„ G CJ nl if 
p{aibx...a n b n \x 1 yx...x n y n ) = J2\ qxPi(aibi\xiy!, A) x 
... x p n (cL n b n \x n y n , A) (similar definitions apply to the 
quantum and no-signaling case). 

In the second scenario, the measurement settings are 
also applied sequentially, but the devices' behavior in a 
given round can depend on the previous measurement 
settings and outputs, i.e., the devices have memory of 
the previous rounds 7 . In this case, we say that p„ 
is local, which we denote p„ € Cf/ , if we can write 
p{a 1 bi...a n b n \x 1 yx...x n y n ) = Y,\q\Pi{aibi\xiyi,\) x 
p 2 (ci2&2 1^22/2, wi, A) x . . . xp n (a n b n \x n y n , w n -x,X), where 
wi = (ai&i . . . a,ibi, X\y\ . . . x n y n ) denotes all inputs and 
outputs up to round i. This situation is the most gen- 
eral one that characterize usual experimental tests of Bell 
inequality. 

Finally, we may also consider a third scenario in 
which Alice and Bob apply all their n inputs at the 
same time, and the device produce all n outputs at 
the same time. We then say that p„ is local, which 
we denote p„ e £f , if p(aih . . . a n b n \xiyi . . . x„y n ) = 
J2x Q\P( a ibi ■ ■ ■ a n b n \xiyi . . . x n y n ,\). In this case, the 
devices can exhibit a collective behavior where the out- 
puts cii of Alice's device at round i depends on the val- 
ues of inputs and outputs of her device at any other 
round, and similarly for Bob's device. This multiple- 
round model is formally equivalent to a single-round 
model with "big" inputs x = x\ . . . x n and y = y\ . . . y n 
and outputs a = a% . . . a n and b = b\ . . . b n . 

The memory model and the simultaneous model 
are strictly more powerful than the independent 
model CJ: n . It is easy to see though that local strategies 
exploiting such memory or c ollective effects canno t repro- 
duce non-local correlations ( Barrett et all [20021 ). which 
necessarily require some genuine non-local resource, such 
as an entangled quantum state. 

An other potential problem though is that in experi- 
mental tests, the correlations p„, which characterize the 
probabilities with with different set of events can be real- 
ized, are not directly observable. Instead one observe a 
finite number of events, representing only one particular 
realization of the set of possibilities encoded in p„ . If the 
local models or £f cannot reproduce non-local 

correlations on average, it could nevertheless be possible 
that clever choices of such multiple-round stragegies, in 
particular exploiting memory or collective effects, could 
increase the chance of a statistical fluctuation resulting in 
an apparent violation of a Bell inequality above its local 



bound. In the case of the independent and memory mod- 
els, which are the most relevant to experimental tests and 
to applications of quantum non-locality, such statistical 
fluctuations are harmless and can easily be controlled 
( Barrett et 

Mi l2002t iGiiJ . l2003h . See section IVII.B.3I for 

a more detailed discussion. This is due to the fact that at 
any given round i, independent and memory local models 
are constrained to satisfy the Bell inequalities, even when 
conditioning on events up to round That is if Pi\ Wi _ 1 
denote the correlations at round i conditioned on the past 
variables we necessarily have s-pji Wi _ 1 < Si for ev- 

ery i, Wi-i, and Bell expression s. 

This last property can nicely be rephrased in the lan- 
guage of non-local games. It implies that to win n in- 
stances of a game, there is no better strategy than using 
each time the strategy that is optimal for a single-round. 
This is not the case in the simultaneous model, where 
all inputs are given at the same time and all output pro- 
duced at the same time. In this case, which is known 
as a parallel repetition of the game in computer science, 
there may exist collective strategies to win n instances 
of the games that are better than using each time the 
optimal single-round strategy. It is in fact known that 
for example the CHSH g ame can be played be tter locally 
over many rounds (see (jBarrett et a/.L l2002h for an ex- 



7 Formally, we consider here two-sided memory models, where each 
device has a memory of every previous input and output, includ- 
ing those of the other device. One can also consider one-sided 
memory models, where each device has only a memory of the in- 
puts and outputs relative to his side o f the Bell experiment but 
not the other one jBarrett et all\200$) ). 



plicit example in the case n = 2), that is, when playing 
the CHSH game many times in parallel the gap between 
the local and quantum bound shrinks. 

The question of whether there exists a better strategy 
for parallel repetition of the game is particularly inter- 
esting from the pe rspective of computer science, see e.g. 
( Cleve et al 1 I2007D . It also tells us something about the 
strength of correlations between physical systems when 
Alice and Bob hold many particles to be measured simul- 
taneously. 

Note that if there exists a strategy that lets the play- 
ers win with probability p w i n in a single round, then they 
can win with probability p^ in when playing the game 
£ times. The question is then whether there exists a 
strategy that beats this value. We speak of (strong) par- 
allel repetition if there exists a non-trivial q such that 
the winning probability when playing the game I times 
always upper bounded by q l . The term perfect paral- 
lel repetition refers to the case whe re q = p w j n . For 
no-signaling strategies, it is known (jHolensteinl . |2007[ ) 
that parallel repetition holds. As quantum and classi- 
cal theory obey the no-signaling principle this also gives 
a bound for quantum and classical correlations. Yet, 
since for many games (e.g. unique games) we have 
Pwin = 1 in the no-signaling case, this bound is not 
always useful. For quantum correlations, it is known 
that for XOR-games (2-outcome corr elation Bell inequa l- 
ities) perfect parallel repetition holds (|Cleve et all 120071 ). 
Again, this also gives a bound for classical correlations, 
but already for the CHSH game it is not known how tight 
this bound actually is. Parallel repetitio n in the quan- 
tum setting also holds for unique games ( Kempe et all 
I2010H . A more general result is known for quantum cor- 
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relations ( Kenxpe and Vidickl . 1201 lbh . however, requires 
the game to be modified slightly to include "check" 
rounds. A s imilar constructi o n can be made for local 
correl ations dFeige and Kilianl l200fl . For local correla- 
tions, 1998) has shown that parallel repetition does 
hold. 



III. NONLOCALITY AND QUANTUM THEORY 

In this section, we analyze the quantum resources — in 
terms of entanglement or Hilbert space dimension — that 
are necessary to produce nonlocal correlations by per- 
forming local measurements on quantum states. Here we 
focus on the case of bipartite states, whereas the non- 
local correlations of multipartite quantum states will be 
discussed in Section IVTl 



A. Nonlocality vs entanglement 

In order to obtain nonlocal correlations from measure- 
ments on a quantum state, it is necessary that the latter 
is entangled. That is, the state cannot be written in the 
separable form 



k 



(42) 



Indeed, if state is of the above form, the correlations ob- 
tained by performing local measurements on it are given 
by 

p(ab\xy) = tr ^ Pk (p k A ® p k B ) M a]x ® M b]y 

. k 

= Y,Pk^{PAM a \ x )tv(p k B M bly ) 

k 

= ^2pkP(a\x,k)p(b\y,k), (43) 



1 19911 ; iHome and Selleril Il99lh 9 . Therefore, all pure en- 
tangled states are nonlocal. The only pure states that 
do not violate Bell inequalities are the product states 
|*) = \ip) A ®\4>) B - 

For mixed states, it turns out that the relation between 
entanglement and nonlocality is much m ore subt l e, and 
in fact not fully understood yet. First, ( Werner! Il989l ) 
discovered a class of mixed entangled states which admit 
a local model (i.e. of the form (fTTj) ) for any possible lo- 
cal measurements. Hence the resulting correlations can- 
not violate any Bell inequality. While Werner considered 
only projective measurements, his results where later ex- 
tend to t he ca se of general measurements (POVMs) in 
(|Barrettl . [2002l) . 

The situation is complicated by the fact that directly 
performing measurements on a mixed state p is not al- 
ways the best way to reveal its non-locality. For instance, 
it may be necessary to perform joint measurements on 
several copies of the state (that is considering the state 
p®p®. . Mp). Alternatively one may need to apply a judi- 
cious pre-processing to p, for i nstance a filterin g, before 
performing the measurements ( PopescvJ , Il995l ). There- 
fore, there exist different possible scenarios for revealing 
the nonlocality of mixed entangled states, some examples 
of which are represented in Fig. [5j which will be discussed 
in more details below. Importantly a state may lead to 
nonlocal correlations in a given scenario but not in oth- 
ers. It is also worth mentioning that when many copies 
of a state can be jointly pre-processed before the mea- 
surements, the problem becomes closely related to entan- 
glement distillation. Indeed, any state from which pure 
bipartite entanglement can be distilled will lead to nonlo- 
cality. For undistillable (or bound) entangled states, it is 
not yet known whether Bell inequality violations can be 
obtained, or w hether these states admit a local model, as 
conjectured in ( Peresl [l999l) . Nevertheless, recent results 
suggest that non locality might in fact be generic for all 
entangled states (jMasanes et al ]. l2008l) . 



which is of the local form (fTTj) , Hence the observation 
of nonlocal correlations implies the presence of entangle- 
ment. 

It is interesting to investigate whether this link can be 
reversed. That is, do all entangled state lead to nonlo- 
cality? In the case of pure states, the answer is posi- 
tive. Specifically, for any entangled pure state, it is pos- 
sible to find local measurements such that the result- 
ing correlations violate a Bell inequality 8 , in particular 
the CHSH inequali ty. This was shown for the case of 
two-qubit states in ( Capasso et al\ . Il973l ) and for bipar- 
tite states of arbitrary Hilbert space dimension in (iGisinl . 



Note t hat this result also holds for al l multipartite pure entangled 
states l|Popescu and Rohrlichl. 1 1992ft . as discussed in more detail 
in section IVll 



1. Single-copy nonlocality 

The simplest possibility to reveal nonlocality of an en- 
tangled state p is to find suitable local measurements such 
that the resulting correlations violate a Bell inequality. In 
the case of a pure state this is always sufficient to reveal 
nonlocality. In particular, as mentioned above, all pure 
entangled states violate the CHSH inequality. For mixed 
states, a necessary and sufficient condition for any two- 
qubit state to violate t he CHSH inequality was given in 
(|Horodecki et «Ull995l) . This criterion works as follows. 
Associate to any two-qubit state p a correlation matrix T p 
with entries = tr [p{ai ® <Jj)] for i,j = 1,2,3, where 



9 Note that this re sult was also s tated, without giving an explicit 
construction, in llWe: 
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FIG. 5 The nonlocality of a quantum state p can be revealed in different scenarios, a) The si mplest scenario : Alice and Bob 
directly perform local measurements on a single copy of p. b) The hidden nonlocality scenario (|Popescd.ll995h : Alice and Bob 
first apply a filtering to the state; upon successful operation of the filter, the perform the local measurements for the Bell test, 
c) Many copies scenario: Alice and Bob measure collectively many copies of the state p. d) Network scenario: several copies of 
p are distributed in a quantum network, where each observers performs local measurements. 



<7j are the Pauli matrices. The maximum CHSH value 
S for p (considering the most general measurements) is 
then simply given by 



S p = 2 



'22i 



(44) 



where mn and 777,22 are the two largest eigenvalues of the 
matrix T p Tj {Tj denotes the transpose of T p ). Using the 
above criterion, it is possible to relate the entanglement 
of p, as measured by its concurrence, to its m aximal vi- 
olation of CHSH (jVerstraete and Woli |2002[ ). 

From the above criterion it is straightforward to see 
that not every entangled two-qubit mixed state violates 
the CHSH inequality. However, contrary to the case 
of pure states, it is here not enough to focus on the 
CHSH inequality. In particular, there exist two-qubit 
states which do not violate CHSH, but violate a more so- 
phisticated Bell inequality (-Z3322, see eg. (1211)) involvin 



three measurement settings per party ([Collins and Gisinl . 
2004). Another example is the two-qubit Werner state, 
given by a mixture of a maximally entangled state \4>+) = 
(| 00) + 1 1 1) )/ a/2 and the maximally mixed state, i.e.: 



pw =p\4>+) (0+1 + (1 -p)-£- 



(45) 



This state is separable for p < 1/3 (and thus does not vio- 
late any Bell inequality) and entangled otherwise. Using 
the criterion (|4"4"|) one finds that S = p2y/2, which leads 



to violation of C HSH for p > 1/^2 « 0.707. However, 
it was shown in (IVertesil . l2008h that the state (|45|) vio- 
lates a Bell inequality involving 465 settings per party 
for p > 0.7056. 

If explicit Bell inequality violations yield upper-bounds 
on the critical value of p necessary to reveal the nonlo- 
cality of the state (|4"S"]) , it is also possible to obtain lower- 
bounds on p by c onstruct i ng ex plicit local models. In 
a seminal paper, ( Wernerl . Il989| ) showed that the cor- 
relations resulting from projective measurements on the 
state (|4"5|) admit a local model if p < 1/2, even though 
the state is entangled for p > f/3. Entangled states 
admitting a local model are usually termed local states. 
Here we describe Werner 's model, following the presen- 
tation of ( Popescul . If 9941 ) . Note first that it is sufficient 
to construct a local model for p = 1/2, since then the 
model can be extended for any p < 1/2 by mixing with 
completely uncorrelated and random data. Let Alice and 
Bob measure the spin polarization of their particles in the 
fiA and n b directions respectively, where vectors describe 
the measurements on the Bloch sphere. The probability 
that both Alice and Bob get the outcome '0' is given by 

p(00\n A ,n B ) = ^(1 - i cosa), (46) 

where a is the angle between Ha and fis- Now, we give 
a local hidden variable model that gives the same statis- 
tics. Let the hidden variable be a vector on the Bloch 
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sphere A = (sin 9x + sin 9 sin <f>y + cos Qz) known to both 
Alice and Bob. In each run of the experiment a different 
A is sent, chosen according to the uniform distribution 
dp(X) = sin t )d9d(f)/A'K. After receiving A, Alice gives the 
outcome '0' with probability 



p A (0, fiA, A) = cos 2 (a j4 /2), 



(47) 



where a a is the angle between fiA and A. At the same 
time, Bob gives the outcome '0' with probability 



Ps(0|n B , A) = 



1 if 2cos 2 (a B /2) < 1, ( 
if 2cos 2 (a B /2) > 1, 1 8j 



where a B is the angle between n B and A. Now one can 
check that the joint probability distribution obtained by 
Alice and Bob using this local model is given by 

PLHv(00\nA,n B ) = J dp(\)pA(0,hA,X)PB(0,n B ,X), 

(49) 

which is indeed equal to (|46[) . It is straightforward to 
check that the above model reproduces the desired cor- 
relations for all measurements outcomes. 

Werner's model was later improved to p < 2/3 in 
(|Acm et qZ.l . l2006bl) . using a connection to Grothcndicck 
constant (see Section lHI.B.2p . Furthermore, the result of 
Werner was extended to the most general quantu m mea- 
surements (so-called POVMs) in (jBarrettl . l2002h . where 
a local model was given for p < 5/12 0.48. Hence, for 
the interval 2/3 < p < 0.7056 (or 5/12 < p < 0.7056 if 
considering POVMs) it is not known whether the non- 
locality of the state (|4"5|) can be revealed by performing 
measurements on a single copy of the state. 

Werner and Barrett also derived a local model for a 
family of states generalizing the two-qubit state (|4"5"T) to 
arbitrary Hilbcrt space dimension d. These arc called 
Werner states, given by 



2-Panti v 1 



(50) 



where 1 is a d 2 x d 2 identity matrix, and P a nti denotes the 
projector on the antisymmetric subspace. These states 
have a particular symmetry, being invariant under oper- 
ations of the form U ®U . The values of a for which pw 
is entangled and admits a local model (for projective or 
general measurements) are summarized in Table [TT1 

The above local m odel s were further exten ded in 
(jAlmeida et all 120071) and (jWiseman et all 120071) to an- 
other family of states generalizing the two-qubit state 
(1431) . namely the isotropic states 



p Uo =p\$ + ){*+\ + Q.-p) 



d 2 ' 



(51) 



where |$+) = (l/Vd) ^2^ = q \ii) is a maximally entangled 
state of local dimension d. Again, for such states there 
exist a range of the parameter p for which p iso is entan- 
gled but admits a local model (see Table HI)) . Note also 



that pi SO violates the CGLMP inequality (see eq. (|22j) ) 
when p is above a critical value, Pnl, that decreases with 
the loca l dimension d . In pa rticular pnl 0.67 when 

d -> oo ((Collins et all l2002a| ) (see Fig®. 

More generally, the approach of ( Almeida et all l2007f ) 
allows one to construct a local model for general states, 
of the form p = p\ip) (ip\ + (1 — p)l/d 2 , where \tp) is 
an arbitrary entangled pure state in C d <g> C d . It is found 
that for p < 6( lo ^ d - ) ) the state p admits a local model for 
projective measurements. Interestingly there is a log(<i) 
gap in the asymptotic limit between the above bound and 
the separability limit, which is given by p < 0(l/d 2 ). An 



uppe r bound on p follows from the result of ([Acm et all 
l2002h - where it is shown that a state of the form p = 
p\<j)+) (<j)+\ + (1 —p)-Jp (where \<f> + ) denotes a two-qubit 
maximally entangled state) violates the CHSH inequality 
for p > Q( ^g. 4 ), which tends to zero when d — > oo. 

This shows that there exist entangled states embedded 
in C d <E> C d which are highly robust against white noise. 

Fi nally, it is w orth pointing out the connec- 
tion (jWernerl . Il989h between the fact that a quantum 
state admits a lo cal model and the ex istence of a sym- 
metric extension ( Dohertv et al ., 2002) for this state. A 
bipartite state pab has a fc-symmetric extension (i.e. a 
k— symmetric extension with respect to part B) if there 
exists a quantum state of k B + 1 parties, p'j LBl Bk , such 
that p' AB . = Pab for every i = l,...,fc, where p' AB . de- 
notes the reduced state of subsystems A and Bi. In 
( Terhal et al I l2003h it is shown that if Alice and Bob 
share a state pab that has a fc-symmetric extension, ev- 
ery experiment where Bob uses at most k B measurement 
settings (independently of the number of outputs) can be 
simulated by a local model. Note that there is no restric- 
tion on the number of measurement settings Alice per- 
forms. This result can be understood as follows: consider 
a Bell scenario where Alice chooses among m measure- 
ments, represented by measurement operators A x with 
x = l,..,m and Bob among k measurements, given by 
By with y = 1, k. Since pab has a fc-symmetric exten- 
sion, for each measurement of Alice, the joint probability 
distribution p(Ai,Bi, ...,Bk) is well defined via the Born 
rule 10 

p{Ai,B u ...,B k ) = tr[p' ABl _ Bk (A i ®B 1 ® ...®B k )}. 

From these distributions one can then define a joint prob- 
ability distribution for all possible measurements as 



p(A 1 ,...,A m ,B 1 ,...,B k ) = 



UT=iP(A l ,B 1 ,... 7 B k ) 
p(B u ...,B k )™- 1 



10 Note that the same argument holds if, instead of a fc-symmetric 
extension, pab has a fc-symmetric quasi-extension, where, in- 
stead of a state of k + 1 parties p' ABl Sfc , one has an entangle- 
ment witness of fc + 1 parties, W A Bi...B k , with unit trace and 
such that the reduced states satisfy W ab- = PAB f° r all *• 
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\A/£>T--n or- o-f a + « fl t^f ^ h 


Isotropic stcttc (| 5 1 P 


Separable 


P< d^T 


P<3TT 


Local for general measurements 


^ (d-i^-^csd-i) 


^ (d- 1)^-^(3(1-1) 


P — (d+l)d d 


^ — (d+l)d d 


Local for projective measurements 




^ — d-1 



TABLE II Separability and locality boun ds for Werner states (I5 0[l and for isot ropic states (|51|l . For Werner states, bounds for 
projective measurements were d erived in dWernerl. 1 19891) and in dBarrettl . I2002T ) for POVMs. For isotropic states, bounds were 
derived in (| Almeida et aUl2007l h as well as in (|Wiseman et all |2007l f for projective measurements. 



This joint probability distribution provides the local 
model. Notice that if a state has a oo— sy mmetric ex- 
tension it is separable ( Dohertv et all 12004 



2. Hidden nonlocality 



In (jPopescul . fl995h a more general way of extracting 
the nonlocality of an entangled quantum state was pro- 
posed. Instead of performing directly local measurements 
on the state, Alice and Bob use a sequential protocol. 
They first apply a local filtering on their systems and, 
conditioned on a successful operation of the filter, per- 
form the local measurements which will lead to nonlocal 
correlations. Popcscu demonstrated the power of this se- 
quential scenario by showing explicitly that certain states 
admitting a local model can display nonlocality if a ju- 
dicious local filtering is performed. Hence, the filtering 
reveals the "hidden nonlocality" of the state. In partic- 
ular, Popescu showed that this is the case for Werner 
states (see eq. ([50]) ) of local dimension greater or equal 
to 5. 

One can intuitively understand hidden nonlocality in 
the following way. The state p initially shared by Al- 
ice and Bob is mixed. Importantly, even if p is local, it 
may be viewed as a statistical mixture involving nonlo- 
cal states. In order to extract nonlocality from p, Al- 
ice and Bob first apply a local measurement for which 
a given outcome can occur only (or most likely) for a 
nonlocal state in the mixture. Hence, by post-selecting 
those events in which this particular measurement out- 
come occurs, Alice and Bob can filter out the nonlocal 
state. Finally, by performing local measurements, they 
can establish nonlocal correlations. 

In order to exclude the existence of a local model re- 
producing this sequential measurement scenario, it is es- 
sential that Alice and Bob choose the measurement ba- 
sis of the final measurement after a successful opera- 
tion of the filter. If this is not the case, a local strat- 
egy could fake Bell inequality violation by adapting the 
outcome of the first measurement based on the knowl- 
edge of which basis has been chosen for the second mea- 
surement .^^A_Jcrmal acc^nt_o^_tlfis_argj^ent_can be 
found in dTeufel et ai , |l997tlZukowski et alj .ll998). The 
work (|Teufel et all Il997f) also present a general frame- 
work for hidden-nonlocality proofs and a suggested non- 
locality complexity classification scheme based on it. 

Other examples of hidden nonlocality were reported. 
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Local 



k-copy nonlocal 



Super-activation 



Bell violation 



Pl 



Pnl 



FIG. 6 Nonlocal properties of the isotropic state (|5ip . The 
state is sep arable for p <y s = l/ (rf+l), admits a local model 
for p < Pl (| Almeida et all I2007T). and violat es a Bell inequal- 
ity for pl < vnt, < v (jColIins et all I2002a1 ). In the interval 
Pl < p < Pnl, it is not known whether the state admits 
a local model or, on the contrary, violates a Bell inequality. 
Finally, when several copies of the isotropic state can be mea- 
sured jointly, nonlocality is obtained when ever a single copy 
of the state is entangled, that is, if p > p s (jCavalcanti et all 
l2012af ) . In the grey region , supera ctivation of quantum non- 
locality occurs dPalazuelosl . l2012aT ) . 



In ( Gisinl . Il996l ). it is shown that there exist two-qubit 
states which do not violate the CHSH inequality, but do 
viol ate CHSH a fter a judicious local filtering is applied. 
In (|PeresL Il99rj) it is demonstrated that five copies of 
a two-qubit state Werner state (|45|) admitting a local 
model for projective measurements display hidden non- 
locality. It is worth noting that these works on hidden 
nonlocality eventually lead to the concept of distillation 
of entanglement, a central notion in quantum informa- 
tion theory. 

Finally, an important question in this area is whether 
all entangled states feature nonlocality when local filter- 
ing is considered. Although this question is still to be an- 
swered, i mportant progr e ss wa s recently achieved. It was 
show in ( Masanes et all l2008f ) that for every entangled 
state p, there exists another state a which does not vio- 
late the CHSH inequality, s uch that p ® a viol ate CHSH 
after local filtering (see also ( Liang et all 12012] )). In par- 
ticular, if one chooses p such that it does not violate 
CHSH, a phenomenon of "activation" of CHSH nonlo- 
cality occurs. 



3. Multi-copy nonlocality 

Another relevant scenario consists in allowing the par- 
ties to perform measurements on several copies of the 
state p. However, here no initial filtering is allowed, 
contrary to the scenario of hidden nonlocality. In the 
multi-copy scenario, represented in Fig. [SJ;, Alice and 
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Bob can perform measurements on k copies of the state 
p, that is, they measure effectively a state of the form 
ptg>k _ p g p (g ... (g p times). The key point here 
is that the parties can now perform joint measurements 
on their k subsystems, that is measurements featuring 
eigenstates which are entangled. Remarkably, the max- 
imal violation of the CHSH inequality for certain states 
can be inc reased if several copies of the state are jointly 
measured ( Liang and Dohertvl . l2006l) . In fact, there exist 
states p which do not violate the CHS H inequality but 
p® 2 does (jNavascues and Vertesil . l201l[ ). 



In more general terms, the possibility of performing 
measurements on several copies of a state leads to a phe- 
nomenon of activation of nonlocality. Notably, it was 
recently demon strated that quantu m nonlocality can be 
superactivated ( Palazuelosi l2012al) . that is, the combi- 
nation of a number of local quantum states can become 
nonlocal. This demonstrates that nonlocali ty is not an 
additiv e quantity. Specifically, it is shown in ( Palazuelosi 
l2012a| ) that by performing joint measurements on many 
copies of a local isotropic state pi SO (see eq. ([51])) of local 
dimension d = 8, it is possible to obtain Bell inequality 
violation, without involving any pre-processing. This is 
remarkable given that the initial state pi SO admits a local 
model for the most general measurements (i.e. including 
POVMs). 

More recently, it was shown that for every state p £ 
C d (g> C d with singlet fidelity 11 larger than 1/d, there ex- 
i st a number of copies k of p such that p® k is nonlocal 
( Cavalcanti et al l l2012al ). This result implies that every 
entangled isotropic state ([5"Tj) is a nonlocal resource, and 
establishes a direct connection between the usefulness of 
a state in quantum teleportation and its nonlocality (see 
Sec. IIII. A.6|) . Whether superactivation of nonlocality is 
possible for any entangled state admitting a local model 
is an interesting open question. 



4. More general scenarios 

It is also relevant to investigate the case in which sev- 
eral copies of a bipartite entangled state p are distributed 
in a network of N observers, as sketched in Fig. [SJi. It 
turns out that here a phenomenon of activation of non- 
locality can also occur. That is, by judiciously placing 
several copies of a state p admitting a local model, nonlo- 
cal correlations among the N observers can be obtained. 
The state p is then termed a "nonlocal resource" . Again, 
activation of nonlocality is possible here due to the fact 
that one (or more) observer can perform a joint measure- 
ment on several subsystems (see Section UlI. A. 3|) . 



Several examples of activation of nonlocality in net- 
works were reported. First, by concatenating many 
copies of a state which does not violate the CHSH 
inequality in an entanglement swappi ng scenar i o one 
obtains a state wh ich violates CHSH ( De et all . 120051: 
iKlobus et al I l2012t) . Second, it was shown that many 
copies of a two-qubit Werner states (|45[) distributed in 
a star network violate a Bell inequality for p > 0.64, 
hence for states whi ch admit a local model f or projec- 
tive measurements ( Cavalcanti et jjD, l2011at iDe et all 
l2005h . The case of isotropic states, as well as other ex- 
amples of activat i on of n onlocality, were in discussed in 
(|Cavalcanti et l2012bh . 



5. Entanglement distillation and nonlocality 



As already hinted in Section UlI. A. 21 the notion of hid- 
den nonlocality is intimat ely related to entanglement dis- 
tillation. For instance, in (|Peresl . ll996l) . the local filtering 
that is applied on several copies of a state c an be used to 
distill entanglement. Hence the protocol of ( Peresl Il996l ) 
can be decomposed as entanglement distillation followed 
by a standard (single-copy) Bell test. In this sense, every 
entangled state that is distillable can be used to obtain 
nonlocal correlations. 

An interesting question then arises concerning bound 
entangled states, i.e., states from which n o entangle- 
ment can be distilled (Ho rodecki et "all. Il998h . In fact, a 
long standing open conjecture — referred to as the Peres 
conjecture — is that every bipartite state with a posi- 
tive partial transposi tion (PPT), hence undistillable, ad- 
mits a local model (|Perest Il999t) . More generally, the 
goal is to understand the link between distillability and 
nonlocality. Notably, several works established a par- 
tial link between both concepts, showing that important 
classes of Bell inequalities cannot be violated by an y PPT 
state (jSalles et all 120081: IWerner and WolJ . l2000h . For 
instance, the violation of the CHSH (and more gener- 
ally of the Mer min inequalities) certifies that the state 
can b e distilled (|Acml . l200lt: lAcin et all 120031: iMasanesl . 
I2006I ). Finally, note that in the multipartite case, nonlo- 
cality is proven not to i mply distillability of entanglement 
( Vertesi and Brunnerl . |2012| ). 



6. Nonlocality and teleportation 



11 The singlet fidelity (or cquivalcntly entanglement fraction) 
of a state p is defined as the maximal fidelity SF of 
p with a maximally entangled state (MES), i.e.SF(p) = 
max|^ )gMBS {^J\p\ip}. 



Quantum teleportation ( Bennett et all Il993l ) is an- 
other "non-local phenomenon" based on quantum en- 
tanglement. As it is the case with nonlocality, it turns 
out that not every entangled state is useful for telepor- 
tation, in the sense of o utperforming classical strategies 
( Horodecki et all Il999f ) . It is then natural to ask if the 
fact that a state is useful for teleportation is related to 
its nonlocality. 

This question was first raised by ( Popescul Il994l ) , who 
noticed that certain two-qubit entangled Werner state 
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admitting a local model can nevertheless be useful for 
teleportation. This lead to the conclusion that usefulness 
in teleportation and nonlocality arc unrelated. Interest- 
ingly, this difference vanishes when considering more gen- 
eral scenarios for revealing nonlocality. In particular, it 
was recently shown that in the multi-copy scenario, where 
several copies of the state can be jointly measured, any 
state that is usefu l for tel eportation is a nonlocal resource 
( Cavalcanti et "oZl. l2012ah . lence, this work establishes a 
direct link between teleportation and nonlocality. 

Note also, that a more qualitative relation between the 
amount of CHSH vi olation and usefulness for teleporta- 
tion was derived in ( Horodecki et aLl . ll996|) . Specifically, 



the maximal violation S p of the CHSH inequality of a 
two-qubit state p is shown to lower bound its average 
fidelity for teleportation as follows 



F, 



1 



telep 



>o(l + 



o2 

XT' 



(52) 



that the optimal classical strategy achieves 



Notice 

Fteiep = 2/3 in the qubit case. 



7. More nonlocality with less entanglement 

As discussed above, the relation between entanglement 
and nonlocality is subtle. Although the case of pure 
states appears to be straightforward, as every entangled 
state is nonlocal, it is interesting to sec whether a quan- 
titative link can be established between both concepts. 
Astonishingly, it turns out that in certain cases, and de- 
pending of which measure of nonlocality is adopted, less 
entanglement can lead to more nonlocality. 

An example is provided by certain Bell inequalities, 
whose maximal violation can only b e achie ved with par- 
tially entangled states ( Acin et al\ . 120021 ) (considering 
states of a given Hilbcrt space dimension). More impor- 
tantly, there exist simple Bell inequalities, the maximal 
violation of which cannot be obtained from maximally 
entangled states of any dimension, but requires partially 
entan gled states (|Liang et al ]. l201lHVidick and Wehnel 
120111) . Also, it is known that there exist Bell inequal- 
ities for which partially entangled states give violations 
which are arbi trarily larger compared to maxima l ly en - 
tanglcd states (jJunge and Palazuelosl I20III : iRegevl Eillh 
(see Section |HLB~2|) . 

Interestingly it turns out that this phenomenon, some- 
times referred to as an anomaly of nonlocality (see 
(|Methot and Scaranil . 120071 ) for a short review), occurs 
for other measures of nonlocality as well. N otably, this 
effect was discovered in 1993 by Eberhard ( Eberhardl . 
1993), who showed that weakly entangled two-qubit 
states are more resistant to the detection loophole com- 
pared to maximally entangled states (see Sec. IVII.B.l.cl) . 
Moreover, the anomaly of nonlocality was also shown to 
occur when considering the statistical strength of Bell 



tests (jAcin et ali , |2005J), and the simul ation of quan 



120051) . 



B. Nonlocality vs Hilbert space dimension 

In this subsection, we consider the link between non- 
locality and another property of quantum systems: the 
dimension of the Hilbert space in which the quantum 
state and measurements arc defined. Indeed, the Hilbcrt 
space dimension generally represents a resource, in the 
sense that higher-dimensional Hilbcrt spaces contain 
more complex quantum states. 

Formally, we say that the correlations p(ab\xy) have 
a d-dimcnsional representation if there exist a state p 
in C d (g> C d , and measurement operators M a \ x and M b \ y 
acting on C d , such that 



p{ab\xy) = tv(pM a \ x <g> M b \ y ) . 



(53) 



turn correlations with nonlocal resources ( Brunner et ali 



It some cases, it is also admitted that p(ab\xy) has a d- 
dimensional representation if p(ab\xy) can be written as 
a convex combination of correlations of the form (I53p . 

In the following, we discuss two natural questions. 
First, what is the minimal dimension d necessary to re- 
produce a given set correlations p{ab\xy)l This ques- 
tion is closely related to the concept of "dimension wit- 
nesses". Second, how much nonlocality can correlations 
of the form (fBU)) contain as a function of dl 



1. Minimal Hilbert space dimension and dimension witnesses 

Here the general question is to determine what quan- 
tum resources, in terms of Hilbcrt space dimension are 
necessary to reproduce certain quantum correlations. For 
instance, if we consider a Bell scenario with a given num- 
ber of inputs and outputs, what is the minimal dimen- 
sion d such that all quantum correlations (i.e. all cor- 
relations attainable in quantum mechanics) can be re- 
produced? This is in general a very difficult problem. 
In the case of binary inputs and outputs, we know that 
qubits (d = 2) are s ufficient, if conv ex combinations are 
taken into account ( Cirel'sonl . [l9 80). However, beyond 
this simple case, very little is known. In fact, we do 
not even know if a finite d is sufficient for a scenario in- 
volving a finite number of measurements and outcomes. 
Actually, recent work suggests that this might not be 
the case (jPal and Vertesil . l2010l ). giving evidence that the 
maximal violation of the ^3322 Bell inequality (see Sec- 
tion |ILB]3J can only be attained using a quantum state 
of infinite dimension. 

A related question is the following. Given some cor- 
relations originating from measurements on a quantum 
system, can we place a lower bound on the Hilbert space 
dimension of the state and measurements necessary to 
reproduce them? That is, can we show that certain cor- 
relations are impossible to obtain with arbitrary quan- 
tum states and measurements of a given dimension. The 
concept of a dimension witness allows one to address this 
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question. Specifically, a dimensio n witness for quantum 
systems of dimension d is a linear (|B runner et 
function of the probabilities p(ab\xy) described by a vec- 
tor w of real coefficients w a bxy , such that 



Although the exact value of the latter is not known, 



W = ^2 w abx V p(ab\xy) < w d 



(54) 



for all probabilities of the form (|53|) with p in C d £§> C d , 
and such that there exist quantum correlations for which 
W > Wd- When some correlations violate (f5"4"]). they can 
thus only be established by measuring systems of local 
dimension strictly larger than d. The simplest examples 
of dimension witnesses involve Bell inequalities featur- 
ing measurements with ternary outcomes, the maximal 
violation of whi ch cannot be reached with qubits, but 
requires qutrits ( Brunner et all l2008bh . Other examples 
will be discussed in the next subsection. 

It is also possible to de vise entropic dimension wit- 
nesses ( Wehner et al ., 2008). which were discussed in the 
context of information-theoretic tasks. Finally, the di- 
mension of a single system can be witnessed using a crite- 
rion that does not re quire entanglement and nonlocality 
( Gallego et all l2010h : since it does not rely on coher- 
ence, this approach does not single out quantum behav- 
ior: for any d, the threshold corresponding to a quan- 
tum d-dimcnsional system can be exceeded by a classical 
(d + l)-valued variable. 



2. Grothendieck's constant and Bell inequalities with 
unbounded violation 



As mentioned in Section III.F1 there exist several pos- 
sible measures of nonlocality. A natural option consists 
in quantifying the strength a given nonlocal correlations 
q through the following quantity 



Kq) = su Ps 



l<S,q)| 



sup peC |(£, p)|' 



(55) 



This represents the ratio between the maximal quantum 
value for a Bell expression S (i.e. \(S, q)|) and the lo- 
cal bound of Bell inequality (i.e. sup pg£ p)|), maxi- 
mized over all possible Bell expressions S. Note that the 
absolute value is important here, otherwise the quantity 
could be ill-defined. This quantity quantifies how much 
local noise (considering any possible local noise) must be 
add ed to q such t hat the global distribution becom es lo- 
cal (jjunge et all l2010bl ; iPerez-Garcia et all l2008h . An 
interesting feature of this quantity is that it provides a 
unified measure of nonlocality, allowing one to compare 
the viol ations of d iff erent Bell inequalities. 

In (|Tsirelsonl . Il987l ). a connection between 
Grothendieck's inequality, which arose in the study of 
tensor norms, and the quantum violation of certain Bell 
inequalities was pointed out. Tsirclson showed that ^(q) 
is upper bounded by Grothendieck's constant Kq for any 
2-outcome correlation Bell inequality (i.e. XOR games). 



it is proven that 1.6769 < K G < 21og( J +vl) ~ 1.7822. 
Importantly, this bound holds for quantum systems of 
arbitrary dimension. 

Moreover, Tsirclson showed that, when restricting 
to qubits, one has that v(Q) < A'3, where A'3 is 
Grothendieck's constant of order 3. Since it is known that 
A' 3 < Kq, it follows that there exist 2-outcome correla- 
tion Be ll inequalities which ar e dimension witnesses for 
qubits (jBrunner et al. , 2008b). Expl i cit ex amples have 
been constructed in ( Vertesi and Pal |2008| ). Moreover, 
it was proven that dimension witnesses for any Hilbcrt 
space dimension d can be obtained from XOR games 
( Brief et al ]. |20 lit I Vertesi and Pall 120091) . 

Tsirelson also raised the question of whether it would 
be possible to have unbounded violations of Bell inequal- 
ities. That is, does there exist a family of Bell scenarios 
for which the quantity v(Q) is unbounded. This is how- 
ever a very hard problem, mainly because of the difficulty 
of finding Bell inequalities and to estimate their quan- 
tum violations. It was discovered recently that the ab- 
stract concepts of operator space theory provide a useful 
framework for the study of violations of Bell inequalities 
in quantum mechanics ( Junge et al\ , l2010~bl) . This con- 
nection lead to a positive answer to Tsirelson's question, 
with examples of unbounded Bell inequality violations in 
quantum mechanics. 

The first resul t in this direction is due to 
(jPerez-Garcia et all 120081) . Using the framework of 
operator space and tensor norms, these authors could 
prove the existence of tripartite correlation Bell in- 
equalities with unbounded quantum violations. Later 
the same authors showed that similar res ults hold for 
(non-c orrelation) bipartite Bell inequalities ( Junge et all 
l2010bl ). More formally, these studies focus on the quan- 
tity v(Q) (see Eq. (|55[1 ). i.e. the maximal quantum viola- 
tion of any Bell inequality S, as a function of the number 
of measurement settings, outcomes, and Hilbert space di- 
mension. Remarkably they show that v(Q) can be upper 
and lower bounded by ratios of different norms of the Bell 
expression S (here viewed as a functional), which have 
been studied in operator space theory. 

While the works mentioned above give non- 
constructive proofs of the existence of Bell inequal- 
ities with unbounded violations, explicit examples 
have also been found. The strongest result is due to 
(lJunge and Palazuelosl l201ll ) who constructed explicitly 
(up to random choices of signs) a bipartite Bell inequal- 
ity featuring a violation of order of j^^, where each 
party has k possible measurements with k outcomes, 
considering quantum systems of dimension d = k. No- 
tably, this constructions appears to be close to optimal, 
as a separation between this viol ation and known upper 
bounds is only quadratic in k ( Junge and Palazuelosl 
120111 : Uunge et all l2010b[ ). Recently, an explicit and 
simplified presen t ation of these Bell inequalities was 
given m ()Regevl I2012D . based on standard quantum 
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information techniques. Other explicit examples of 
Bell inequ alities with unbounded violations have been 
presented ( Buhrman et all 1201 ll ). Finally, an upper 
bound to the maximum possible Bell violation (for any 
possible B ell inequality) of a given quantum state was 
derived in (jPalazuelosll2012bh . 



C. Simulation of quantum correlations 

So far we have examined which quantum resources 
are necessary to produce nonlocal correlations, in term 
of entanglement or Hilbert space dimension of quantum 
states. Here, we discuss the converse question. How can 
we use nonlocal resources to characterize and quantify 
the nonlocality of entangled quantum states? If a state 
violates a Bell inequality, we know that its measurement 
statistics cannot be reproduced by a local model. How- 
ever, we can simulate its correlations if we have access 
to a nonlocal resource, such as classical communication 
or nonlocal resources such as the PR-box. The minimal 
amount of nonlocal resources required can then be con- 
sidered as a measure of the nonlocality of the state. Here 
we give a brief review of progress in this direction. 



1. Simulating the singlet state 

A classical simulation protocol of a given quantum 
state aims at reproducing the correlations obtained 
from local measurements on \ip) 1 using only shared ran- 
domness and classical communication. For definitcness, 
we shall focus here on the singlet state of two qubits, i.e. 
|V') = ( 1 01) — |10))/-\/2, which is also the most studied 
case. 

Alice and Bob first receive as input a unit vector on the 
Bloch sphere, i.e. fiA,nB € K 3 , representing projective 
measurements fiA ■ o and fis ■ 3, where a is the vector of 
Pauli matrices. Then, they are allowed to exchange clas- 
sical communication. Finally, they must produce binary 
outcomes a, b = ±1 reproducing the expected statistics, 
i.e. p(ab\fiAfiB) = 1/4(1 — abfiA ■ ?Vb). 

It is then interesting to look for the model using the 
least classical communication, since the smallest number 
of bits required to simulate can be considered as a 
measure of the nonlocality of\ip). This a pproach was pro- 



posed independently by severa l auth ors (IBrassard et 
119991: iMaudlinl . Il992t Isteinerl . 120001) . These first par - 



tial results were superseded in ( Toner and Baconl |2003^> . 
where it is shown that a single bit of communication is 
sufficient to simulate exactly the correlations of local pro- 
jective measurements on a singlet state. Note that in 
this model Alice and Bob use infinite shared random- 
ness, which is proven to be necessary fo r models with 
finite communication ( Massar et alluOOlh . 

It is also interesting to investigate simulation models 
using only non-signaling resources, such as the PR box 
(see Section III. C. 21) . Remarkably, a single PR box is 



enou gh to simulate the singlet correlations ( Cerf et all . 

The latter model is even more economical than the 
model of Toner and Bacon, since a PR box is a strictly 
weaker nonlocal resource; indeed, while it is possible to 
get a PR box from one bit of communication, the oppo- 
site is impossible since the PR box is non-signaling. 

Finally, it is also possible to devise a simulation model 
of the singlet state in which post-selection is allowed 
(jGisin and Gisinl . I1999T ) . that is, the parties are not re- 
quired to provide an output in all runs of the protocol. 
Indeed post-selection should be considered as a nonlocal 
resource, giving rise to the detection loophole (see Sec. 
IV11.B.1D . 

An elegant uni fied presentation of a ll the above models 
can be found in ( Degorre et all 120051) . 



2. Other quantum states 

The simulation of quantum correlations of arbitrary 
bipartite entangle d quantum states has als o been inves- 
tigated. Notably, ( Regev and Tonerl . 120071 ) showed that 
the correlations obtained from local measurements with 
binary outputs on any p G C d <S) C d can be simulated 
with only two bits of co mmunication, which is pr oven to 
be necessary in general ( Vertesi and Bend . 120091) . Note 
however that this model focuses on the correlations be- 
tween the outcomes of Alice and Bob, and does not, in 
general, reproduce the expected quantum marginals. 

A case of particular interest is that of partially entan- 
gled qubit states, i.e. \ipg) = cos 9 1 00) +sin# |11). While 
it is shown that its correlations (including marginals) can 
be per fectly simulated with two bits of communication for 
any 9 ( Toner and Baconl 120031 ) , it is not known whether a 
single bit of communication would suffice. It is however 
proven that a simulation model us ing a single PR box 
does not exist ( Brunner et aZ.I . [2005h for weakly entangled 
states (9 < n/7.8). Thus it appears that less entangled 
states require more nonlocal resources to be simulated 
compared to maximally entangled ones, illustrating the 
subtle relation between entanglement and nonlocality. A 
simulation model for states \ipg) u sing only non-signalin g 
resources has also been presented ( Brunner et all l2008al) . 

Moreover, ( Brassard et all Il999h established that the 
simulation of measurements with d outcomes on a maxi- 
mally entangled state in C d <g> C d requires classical com- 
munication of order d bits. Therefore, there exist families 
of quantum nonlocal correlations requiring an arbitrarily 
large amount of classical communication for being simu- 
lated. 

More generally the problem of simulating quantum 
nonlocal correlations is intimately related to the field of 
communication complexity. Thus, many results on com- 
munication complexity are relevant in the context of non- 
locality. For more details on communication complexity 
and on the procedure for converting communication com- 
plexity problems in to nonlocal tasks, we r efer the reader 
to a recent review ( Buhrman et "aLl. l2010D . 
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3. Elitzur-Popescu-Rohrlich decomposition 

A different perspective on simulating quan tum corre- 



n g quan t 

lations was presented in (lElitzur et all . Jl992h — often re- 



ferred to as the EPR2 approach. These authors pro- 
posed to decompose a quantum probability distribution 
p q (ab\xy) into a local and nonlocal part. Formally, that 
means to write p q as a convex combination of a local 
distribution (pi) and a nonlocal one (p n s)'- 

p q (ab\xy) = wpi(ab\xy) + (1 - w)p ns (ab\xy). (56) 

Note that, since p q and pi are no-signaling distributions, 
p ns is also no-signaling (hence the subscript ns). Clearly, 
any distribution can be written in this way (take for in- 
stance w = and p q = p ns ). To find the EPR2 decom- 
position, one then finds the maximum of w among all 
possible decompositions of the form (|56[) . This quantity, 
denoted w max , defines the local content of the distribu- 
tion p q . The EPR2 decomposition can be understood as 
a simulation of the distribution p q where, with probabil- 
ity w max a local distribution is used, and with probability 
1 — w m ax a nonlocal (no-signaling) distribution is used. 
Note that q ma x can also be considered as a measure of 
the nonlocality of the distribution p q : if w ma x = 1, p 9 is 
local; if w max < 1, Pq is nonlocal; if w max = 0, p q is fully 
nonlocal. 

One can bound w max , for a given distribution p q , 
trough the violation of a Bell inequality b-p < 6/. Denote 
Q, the Bell value of distribution p q . It is straightforward 
to see that 



< 



(57) 



where b ns is the maximal value of the Bell expression 
b for any no-signaling distribution. Notice that, if pg 
reaches the maximal value allowed by no-signaling (i.e. 



Q = b ns ), then w„ 



0. This means that the quantum 



distribution is maximally nonlocal according to the EPR2 
decomposition, hence no local part can be extracted. 

It is also possible to define the local content of a quan- 
tum state. To do this, consider all possible measurements 
that can be applied to a quantum state and then derives 
the local content for the dist ribution obtained fr om these 
measurements. Originally, ( Elitzur et all Il992l ) showed 
that the maximally entangled state of two qubits has zero 
local content, i.e. it is fully nonlocal. This result was 
then generalized to any b ipartite maximally entangled 
state ( Barrett et all 120071 ). via a generalization of the 
chained Bell inequality, showing that such states can pro- 
vide maximally nonlocal and monogamous correlations, 
which is relevant for instance in quantum cryptography. 

The local content of other quantum states has also 
been discussed. In particular, for the case of two-bit en- 
tangled pure states \ipg) = cos#|00) +sin#|ll), with 
8 6 [0,7r/4l, it was p roven that q max = 1 — cos(20) 
dPortmann et all l2012h . The EPR2 decomposition of 
pure entangled two-qutrit states was also sketched in 
(lScaraniLl2008h . 



Finally, note that these ideas were generalized to the 
multipartite case (see Section |VI)) . 



IV. APPLICATIONS OF QUANTUM NONLOCALITY 

When considering nonlocality as a potential resource 
for information processing, two intuitive ideas immedi- 
ately come to mind. First, since the existence of non- 
local correlations between the two wings of a Bell ex- 
periment seems to imply some connection between these 
two distant wings, one could hope to exploit this connec- 
tion to communicate, and in particular to communicate 
faster-than-light. Second, since a local model for a Bell 
experiment is equivalent, as we saw in Section IlI.Bl to a 
deterministic model in which a definite outcome a(x) and 
b(y) is assigned in advance to every measurement x and 
to every measurement y, nonlocality then suggest, in con- 
trast, that these measurement outcomes are fundamen- 
tally undetermined and thus that they could be used to 
establish cryptographic keys. Both ideas are partly true 
and partly misleading. In both cases, the no-signaling 
principle plays a fundamental role. 



A. Communication complexity 

In the first example discussed above, no-signaling acts 
as a limitation: we have already seen that the no- 
signaling conditions (|7|), which are satisfied by any set 
of correlations arising from measurement on quantum 
systems, imply that Bob's outcome does not reveal any 
information about Alice's input x and the other way 
around. Thus, no-signaling prevents the use of non-local 
correlations as a substitute for direct communication be- 
tween Alice and Bob. It may then come as a surprise 
that non-locality can nevertheless be exploited to reduce 
the amount of communication in certain distributed com- 
puting tasks, both in information-theory and the study of 
communication complexity. In the setting of communica- 
tion complexity, Alice receives an n-bit string x and Bob 
receives an n-bit string y and the aim is for Bob to com- 
pute some function f(x,y) with as little communication 
between Alice and Bob. This can always be achieved if 
Alice sends her n-bit string x to Bob, but for certain 
functions less communication is sufficient. The mini- 
mum number of bits that must be exchanged between 
Alice and Bob for Bob to determine f(x, y) is known 
as the communication comple xity of /. It was realized 
in ( Cleve and Buhrmanl . Il997l) that if Alice and Bob can 
share systems exhibiting non-local correlations, then they 
can compute certain functions with less communication 
than would be required without such non-local systems. 
This phenomenon does not violate the no-signaling prin- 
ciple because the knowledge that Bob obtains about Al- 
ice's input through f(x, y) is no greater than what is 
already conveyed by Alice's communication. The field of 
communication complexity is an active field of research in 
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computer science, in which strong connections with non- 
locali ty have been discovered since ( Cleve and Buhrmanl . 
Il997h . For more details, we refer the reader to a recent 
review on the subject (jBuhrman et all feoiOl ). 



duction of cryptographic keys that are genuinely random 
and secure to any adversary limited by quantum theory 
or, more generally, by the no-signaling principle. 



B. Information theory 

Nonlocal correlations can enhance communication 
power in the context of information theory. Consider 
two parties, Alice and Bob, communicating via a noisy 
communication channel. If we only care about the 
rate at which information is transmitted from Alice 
to Bob such that the error rate goes to zero in the 
large block length limit, then this tra nsmission rate 



cann ot be increased using entanglement (Bennett et al 



2002) or even no-signaling correlations (jCubitt et al 
20111 ). However, the situation changes when we care 



about the rate at whi ch information can be sent with- 
out any error at all ( Cubitt et all 1 2 1 lh . The maxi- 
mum such rate is known as the zero-error capacity of a 
noisy-communication channel. For example, it is known 
that if Alice and Bob share certain no-signaling correla- 
tions, zero-error transmission becomes possible through 
a noisy-channel, even if that channel's zero-error ca- 
pacity is_zero_w^out_the ability to use such correla- 
tions (jCubitt et all 120101) . Notably even certain quan- 
tum correlations are useful in this context. 



C. Quantum cryptography 

In our second intuition discussed above, that the vio- 
lation of Bell inequalities guarantee the presence of ran- 
domness, the no-signaling principle is no longer a limi- 
tation, but a prerequisite. Indeed, in the same way that 
every local model can be seen to be equivalent to a local 
deterministic model, every nonlocal model is equivalent 
to a nonlocal deterministic model where, for each mea- 
surement runs, definite outputs a(x, y) and b(x, y) are as- 
signed to every pair of inputs (x, y). In full generality, the 
violation of Bell inequalities docs not therefore guarantee 
by itself any indeterminacy in the outcomes a and b (as 
we have already stressed in the introduction, non-locality 
is - as its name indicate - about the violation of locality, 
not about the violation of determinism). However, every 
nonlocal deterministic model is necessarily signaling: if 
a(x, y) depends non trivially on both x and y, then Alice 
can recover some information about Bob's input y from 
the knowledge of the output a and her choice x. In a 
model that reproduces nonlocal correlations and which 
is intrinsically no-signaling, the measurement outcomes 
cannot therefore be fully determined in each measure- 
ment run and they must exhibit some randomness. This 
intuition is at the basis of device-independent cryptogra- 
phy in which the violation of a Bell inequality, which can 
be asserted without any detailed physical assumptions on 
the working of the quantum devices, guarantees the pro- 



1. Initial developments 

One of the earliest conne ctions between nonlocality 
and cryptography is due to (Herbert], Il975l ). who inter- 
preted the '0' and '1' outcomes produced by two distant 
quantum devices as correlated binary random messages. 
By considering the error rates in such messages, he pre- 
sented an elementary derivation of Bell's theorem, but he 
did not go as far as deducing that quantum nonlocality 
could be exploited for a secure cryptographic scheme. 

The practical application of Bell non-locality to cryp- 
togr aphy was firs t realized by Ekert in his celebrated pa- 
per ( Ekertj . Il99ll ). which represents more generally one 
of the founding articles of quantum cryptography. The 
problem of establishing a secure, encrypted communica- 
tion between two parties can be reduced to the problem of 
generating a secure, cryptographic key, i.e., a sufficiently 
long strings of random bits that are shared between Al- 
ice and Bob, but unknown to any potential eavesdropper 
Eve. Ekert presented a protocol for this key distribu- 
tion problem which is based on the CHSH inequality and 
uses a source of two-qubit maximally entangled states 
| <p + ) — here we present a sl i ght va riation of this protocol 
introduced in ( Acm et all . 120063) . Each party repeat- 



edly receives one qubit from the source and performs a 
measurement on it. In each run, Alice chooses among 
three possible measurements x = 0, 1,2 and obtains an 
outcome bit a; Bob chooses among two possible mea- 
surements y = 0, 1 and obtains an outcome bit b. Once 
all states have been measured, Alice and Bob announce 
publicly the settings they have chosen for each particu- 
lar measurement and divide their results in two groups. 
The subset of the results corresponding to the measure- 
ments x = 1, 2 and y = 0, 1 is used to evaluate the CHSH 
inequality violation. Hence these measurements are cho- 
sen such as to maximize this violation (see Section [I. Al) : 
for instance, Alice measures in the direction 0, 7r/2 in the 
x— z plane of the Bloch sphere for x = 1,2 and Bob in the 
directions — 7r/4,7r/4 for y = 0, 1 (also in the x-z plane). 
The subset of results corresponding to the choices x = 
and y = are used to generate the shared key. Hence 
the measurement x = is chosen in the same direction 
— 7r/4 as Bob's input y = 0, in such a way that the key 
bits a and b are perfectly correlated. 

The CHSH violation guarantees, as discussed earlier, 
that the key bits are undetermined and in particular that 
Eve could not have fixed them in advance. More gener- 
ally, Eve could attempt to obtain information about the 
values of a and b by performing delayed measurements 
(after the public disclosure of Alice and Bob's settings) 
on a system of her own correlated with Alice and Bob's 
systems. As remarked by Ekert, the protocol is also se- 
cure against such attacks as a maximal CHSH violation 
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guarantees that the state shared by Alice and Bob is 
(essentially equivalent) to a pure entangled \<f>+) state, 
which cannot be correlated to any system in Eve's pos- 
session. In a realistic implementation, Alice and Bob's 
key bits will not be perfectly correlated and the CHSH 
violation will not be maximal, implying that Eve can ob- 
tain some finite information on these key bits. But pro- 
vided that these imperfections are not too important, it 
should be possible to distill a shared secret key from the 
raw data of Alice and Bob by applying error-correction 
and privacy amplification protocols. 

The intuition for security in the Ekert protocol is based 
on the violation of a Bell inequality which can be assessed 
independently of the protocol's implementation, but this 
aspect was not fully recognized at the time. When assum- 
ing that Alice and Bob's devices perform measurements 
on qubits in complementary bases, Ekert's protocol was 
found to be equivalent to an entanglemen t-based version 
of th e Bennett-Brassard 1984 protocol ( Bennett et all 
11992ft . This was important in establishing entanglement 
as a central concept for QKD, but it also implied that 
the subsequent security proofs used "qubits" and "com- 
plementary bases" as implicit assumptions 12 . One crucial 
point was (understandably) missed in those early days: 
that the implicit "qubits" and "complementary bases" 
assumptions require a very good control of, and ulti- 
mately some trust about, the physical implementation 13 . 

However, as the reader knows by now, the violation of 
Bell inequalities can be established without such knowl- 
edge. Therefore, a cryptography protocol based on non- 
locality requires fewer assumptions: the devices can in 
principle be tested and the security of the protocol cer- 
tified without any detailed characterization of the de- 
vices; to some extent, the devices can even be malicious 
and been prepared by the eavesdropper. This is called a 
device-independent (DI) assessment. 

The idea of DI quantum cryptography was first made 
explicit by Mayers and Yao. who called it self-testing 
(jMavers and Yaol . 1 19981 120041 ) . Though their analysis is 
not directly based on Bell inequalities, it obviously ex- 
ploits correlations that are nonlocal. The breakthrough 
that pushed the recent development of DIQKD came with 
Barrett, Hardy, and Kent who introduced a QKD proto- 
col based on the chained Bell inequality and proved it to 
be secure against "super-quantum" eavesdroppers that 
may violate the law of quantum physics but which are 
constrained by the no-signaling principle ( Barrett et all 



12 Quantitative relations between security bou nds and the viola- 
tion of Bell inequalities were pointed out llScarani and Gisinl . 
I2001T) ; but this link turned out to be an artifact of the assump- 
tion called "individual attacks" and did not survive in stricter 
security proofs, which were rather derived from the notion of 
entanglement distillation. 

13 It is interesting to notic e, though, the note added in print to 
the l|Bennett et al paper, which come close to recognize 
explicitly the device-independent aspect of Ekcrt scheme. 



l2005aft . A practical protocol based on the CHSH inequal- 
ity was then introduced in ( Acm et all . l2006al ) (though 
it was proved secure only against a restricted family of 
attacks), where it was also noticed that proving security 
assuming only the no-signaling principle implies in par- 
ticular that one can do away with the "device-dependent" 
assumptions of standard QKD. The DI potential of such 
QKD scheme based on Bell in equalities was then fully 
perceived in ( Acm et all . 12007ft . which introduced a DI 
security proof for collective attacks of the variation of 
Ekert's protocol presented above against an eavesdrop- 
per constrained by the entire quantum formalism and 
not only the no-signaling principle. 

Finally, the ideas from DIQKD have been adapted to 
the simpler task of DI randomness gene ration (DIRNG) 
in (|Colbeckl 120071: iPironio et ail. l2010aft and to d istrust- 
ful quantum cryptography in ( Silman et all 1201 lh , where 
a scheme for the device-independent implementation of 
(imperfect) bit commitment and coin tossing was intro- 
duced. 

In the following we discuss in more details the sta- 
tus of current security proofs for DIQKD and DIRNG, 
the assumptions on the devices that underlie them, and 
the prospects for experimental implementations. We first 
briefly discuss the quantitative aspects of the relation 
between randomness and non-locality since it is at the 
basis of many security proofs and protocols. Note that 
the development of DIQKD and the recent attacks on 
standard QKD protocols such as ( Lvdersen et all 12010ft 
have led to a series of feasible proposals for QKD that 
are intermediate between device-depend ent and d evice- 
independent schemes , see for instance rtLo et 
iLvdersen etaH l2010l : iTomamichel et all 120121 We do 
not review this work here, as it does not directly relies 
on nonlocality as a resource. 



2. Randomness vs non-locality 

a. Quantitative measures of randomness. Imagine Alice 
holds of a measurement device that produces outcomes a 
when performing a measurement, where we let Ra denote 
the random variable of the outcome. When can we say 
that a is random? One way to think about randomness 
is by means of introducing an observer, Eve, who tries 
to guess Alice's measurement outcome a - the better the 
guess the less random a is. In order to guess a, Eve may 
perform an arbitrary measurement on a system E, which 
is possibly correlated with the one of Alice. We will use 
z to label her measurement setting and e to label her 
measurement outcome. For any given z, Eve best guess 
for a corresponds to the most probable outcome, the one 
maximizing p(a\ez). The guessing probability of Eve is 
then defined as her average probability to correctly guess 
a, maximized over all her possible measurements 

P S uess(RA\E) := maxS^ p(e\z) maxp(a|e, z) . (58) 

z * — * a 
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This guessing probability can also be ex- 
pressed as the min- entropy H m \„ (Ra \E) = 
-\og 2 p gucss (R A \E) (jKonig et all 120091 ). It takes 
on values between and log | J2>i | , corresponding to the 
cases where Eve can guess perfectly, and where Eve's 
probability of guessing is no better than for the uniform 
output distribution 1/l-R^I respectively. 

The min-entropy is a good measure of how random Al- 
ice's measurements outputs are because it tells us exactly 
how many uniform classical random bits I can in princi- 
ple be obtained from a classical string a € Ra by apply- 
ing some function f r : Ra — > {0, l} e . It is easy to see 
that if we only have a guarantee about the min-entropy 
of the so-called source Ra, then no randomness can be 
obtained using just one deterministic function /. How- 
ever, if we are willing to invest some perfect randomness 
labeled R = r from an initial seed, and choose a function 
f r depending on it, then we can obtain randomness. This 
process is known as randomness extracti on and e n joys a 
long history in computer science (see ( Vadhanl . [20121 ) 
for a survey). Formally, a (strong) extractor produces an 
output Pf{r a )e that is close to uniform and uncorrelated 

from Eve Pf(r a )er ~ ® PerWi < 6 f° r some small 
e, even if Eve later learns which function f r we applied. 
In the context of cryptography, this is also called privacy 
amplification. If Eve only holds classical side- information 
about Alice's system it is known that randomness extrac- 
tion is possible, where the maximum outp ut size obeys 
I « H min (R A \E) ([impagliazzo et all [19891 . T his is also 
true i f Eve h o lds quantum side- infor mation ( De et all . 
120091 : iRennerl . 120081 : iTa-Shmal . [20091. More gene rally 



the full quantum min-entropy ( Konig et all I2009T ) has 
been shown to characterize exactly how much random- 
ness can be obtain ed by making measurements on A in 
( Berta et al I l2012h . However, no such general result is 
known if Eve holds arbitra ry no-signaling (i.e.. supra- 
quantum) side-information ( Hanggi and Rennerl . l201dh . 
see section IIV.C.4I for a more detailed discussion. 



b. Randomness and Bell violations In order to discuss 
quantitative links between randomness and the viola- 
tions of Bell inequalities, it is useful, as in the above 
discussion, to introduce an additional observer and thus 
consider nonlocal correlations shared between Alice, Bob 
and Eve. In such a tripartite setting, the correlations 
are characterized by the probabilities p(abe\xyz). If Eve 
measures z and obtains e, then Eve's characterization 
of Alice's device is now given by the conditional proba- 
bility distributions p(a\xez). If Eve learns x, then for 
any given z her best guess for a corresponds to the 
most probable outcome maximizing p(a\xez). Maximiz- 
ing over z thus means that Eve can guess a with prob- 
ability p gucss (RA\EX = x). In the case where a can 
take on two values and Alice and Bob's devices are char- 
act erized by a CHSH e xpectation value S it is shown 
in ( Pironio et "aLl. l2010all (see also ([Masanes et aUl201lh 
for an alternative derivation), that independently of the 



devices' behaviours and Eve's strategy 

Pgucss 



(R A \EX = x) < i (l + V2 - 5 2 /4) . (59) 



In particular, when S = 2\/2, we get as expected 
P guess (A\EX = x) < 1/2 corresponding to 1 bit of 
min-entropy H nlln (RA\EX = x), implying that Alice's 
output is fully random. While when the CHSH ex- 
pectation achieves the local bound S = 2, we get 
the trivial boun d p euess (A\EX = x) < 1. Using the 
SDP hierarch y ([Navascues et all 120071 . l2008h (see Sec- 
tion Ill.C.l.dp it is possible to derive analogous bounds 
for arbitrar y Bell inequal i ties, se e Supplementary infor- 
mation of ( Pironio et all l2010al ) for details. One can 
also compute upper-bounds on the guessing probability 
not only for the local randomness (corresponding to the 
output a alone), but also of the global randomness (cor- 
responding to the pair of outcomes a and b). At the 
point of maximal CHSH violation, for instance, one finds 
p gucss (R A RB\EX = xY = y) < 1/4+ v^Z^^ 0.427 



corresponding to 1 . 23 bits of min-entropy ([Acfii et all 
120121: IPironio et all [2010a[l , where the random variable 
R B corresponds to Bob's outcome. 

The above bounds on the guessing probability are 
obtained by assuming that the devices and the eaves- 
dropper obey quantum theory. Similar bounds can be 
obtained assuming only the no-signaling principle. In 
this case, one obta ins the following tight bound for the 
CHSH inequality (TBarrett et all . l2007t iMasanes et all 
120091; IPironio et all l2010all 



s (R A \EX=x)< 



S 
4 



(60) 



At the point 5* = 2\/2 of maximal quantum violation, one 
finds p guess (A\EX — x) < 0.79 which is, as expected, less 
constraining than the quantum bound (|59p . Maximal 
randomness, p guess (A\EX = x) < 1/2 is obtained now 
only at the maximal no-signaling violation S = 4 of the 
CHSH inequality, corresponding to a PR-box. The above 
bound has also been generalized for the d-o utput ro-input 
chain ed inequality (see Section Hi. B.3| in ( Barrett et all 
120071) : 



P g u CSS (RA\EX = x) = l + ^sitTL 



(61) 



For m — > 00 the maximal quantum violation of the 
chained inequality tends to the maximal no-signaling vio- 
lation ^chained = — note that the Bell inequality is writ- 
ten as 

S (d,m) > d - 1, thus a "high" violation means 
a lower value for S^'"^. In this limit, one thus gets 
Pguess (Ra I E X = x) < 1/g?, i.e. the outcome can be certi- 
fied to be fully random even assuming no-signaling alone. 
This property is ce ntral to the se c urity o f the QKD proto- 
col introduced in dBarrett et all l2005al) a nd was further 
developed in (IColbeck and Rennerl . l200i 1201 lb to show 
that some extensions of quantum theory cannot have im- 
proved predictive power. 
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Naively one would expect that less nonlocality in a 
Bell-type experiment implies less randomness. In the 
quantum setting, this intuition is not always correct. 
In the case of two-output Bell scenarios, the maximal 
amount of local randomness (characterizing the single 
outcome a) corresponds to 1 bit of min-entropy and 
the maximal amount of global randomness (character- 
izing the joint outcome pai r a, b) corresponds to 2 bits. 
It is shown in ( Arin et al\ , |2012| ). through a family of 
(non-facet) two-input two-output Bell inequalities, that 
such values can be attained with nonlocal correlations 
that are arbitrarily close to the local region or which 
arises from states with arbitrarily little entanglement. 
This work suggests that while non-locality is necessary 
to certify the presence of randomness, its quantitative 
aspects are related to the extremality of non-local cor- 
relations. Extremality was already identified as a key 
property for characterizing the behaviours which are in- 
de pendent of any mea surement results of an eavesdropper 
in ( Franz et all 1201 lh . This work also presents different 
tools to certify and to find extre mal behaviours for p ar- 
ticular Bell scenarios. Finally, in ( Dhara et all [20121) . it 
is shown how that maximal global randomness can be 
obtained in a variety of scenarios (including multipartite 
ones) from the violation of certain Bell inequalities. 



3. Device-independent randomness generation 

The above relations between non-locality and random- 
ness immediately suggest to use Bell-violating devices to 
certi fy the generatio n of random numbers in a DI man- 
ner (IColbeckJ. 120071) . This idea was further developed 
m (|Pironio et a/.l . l2010al) . where a practical protocol for 
randomness generation was introduced, the first quanti- 
tative bounds on the randomness produced where shown, 
and a proof-of-principle experimental demonstration was 
performed. 

The bounds that we have presented above only relate 
the randomness and expected Bell violation of a pair 
of quantum devices for a single use of the devices. In 
an actual protocol for DIRNG, however, the devices are 
used n times in succ ession. A typical protocol consists 
of three main steps (IColbeckl . 120071 IColbeck and Kent! 
|201 it iPironio et all l2010al) . A measurement step, where 
the successive pairs of inputs {x\, yi), . . . , (x n , y n ) are 
used in the devices, yielding a sequence of outputs 
(di, &i), . . . , (a n , b n ). An estimation step, where the raw 
data is used to estimate a Bell parameter (if this param- 
eter is too low, the protocol may abort). A randomness 
extraction step, where the raw output string is processed 
to obtain a smaller final string r — n , ■ ■ ■ , r m which is 
uniformly random and private with respect to any poten- 
tial adversary. In addition to the Bell violating devices, 
the protocol may also consume some initial random seed 
for choosing the inputs in the measurement step and for 
processing the raw data in the randomness extraction 
step. If more randomness is generated than is initially 



consumed, one has then achieved DI randomness expan- 
sion. 



In ( Pironio et Ml, l2010al) . a generic family of proto- 
cols based on arbitrary Bell inequalities and achieving 
quadratic expansion are introduced. These protocols are 
robust to noise and generate randomness for any amount 
of violation (up to statistical errors). The analysis of the 
randomness that is produced is based on an extension 
of the single-copy bounds of the form ([55)) and to 
the n-copy case. A proof-of-principle implementation us- 
ing two entangled atoms separated by about 1 meter was 
also reported (see Section IVH]) . The security of these pro- 
tocols has been proven against quantum or no-signaling 
adversaries with classical- side information. The techni- 
cal tools for proving sec urity were already introduced 



in ( Pironio et all 2010a), but this was rigorously es- 



tablis hed only in ( Fehr et all . 120131 : IPironio and Massail 
120131 ). In these later works, it was further shown how 
to achieve supcrpolynomial randomness expansion by re- 
peatedly using the randomness of a pair of devices as 
input for another pair. A scheme based on the CHSH 
inequality secure against adversaries with quantum-side 
information and achieving superpolynomial expansion 
with a single pair of quantu m devices was obtained in 
( Vazirani and Vidickl . l2012al ). This scheme, though, re- 
quires a high violation of the CHSH inequality and is not 
noise-tolerant. 

The security of the above protocols relies on a series of 
minimal assumptions. First, the devices and the eaves- 
dropper are constrained by quantum theory, or at least 
by the no-signaling principle. Second, the initial ran- 
domness seed is independent and uncorrelated from the 
devices' behavior. Third, the two quantum devices are 
non- interacting during each successive measurements 14 . 
Fourth, it is also implicit of course that the devices can 
be secured, in the sense that they do not leak directly un- 
wanted information to the adversary. Apart from these 
basic requirements, the devices are mostly uncharacter- 
ized. In particular, no assumptions are made on the spe- 
cific measurements that they implement, on the quantum 
state that is being measured or on the Hilbert space di- 
mension, etc. 

The level of confidence in the realization of the above 
assumptions in an actual implementation or the measures 



14 Note that this does not necessary imply that the measurements 
should be space-like separated in the relativistic sense. This 
space-like separation is required to close the locality loophole in 
fundamental tests of Bell inequalities, where the aim is to rule 
out alternative models of Nature that can go beyond present-day 
physics. In the context of DIRNG, we assume however from the 
beginning the validity of quantum theory and use Bell inequali- 
ties as a tool to quantify in a DI way the randomness of quantum 
theory. Once we assume quantum theory, they are many ways to 
ensure that the two systems are not interacting other than plac- 
ing them in space-like intervals, e.g. by shielding the dev ices. 
See Supplementary Information C of l|Pironio et all l2010af ) for 
a more extensive discussion 
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that must be taken to enforce them may vary depending 
on the adversary model that one is considering. For in- 
stance, it may depend on whether the devices are con- 
sidered to be outright malicious and programmed by a 
dishonest provider (i.e. the adversary itself) or whether 
the manufacturer of the device is assumed to be honest 
and the concept of DI is merely used to account for lim- 
ited cont rol of the apparatus or unin tentional flaws in the 
devices ( Pironio and Massarl I2013D . In the later case, 
in particular, a weak source of randomness, such as a 
pseudo-random generator, may be sufficient for all practi- 
cal purposes to generate the initial seed (in which case the 
protocol, which produces strong cryptographically secure 
randomness, is best viewed as a randomness generation 
protocol than an expansion one). Note in addition that 
in the honest-provider scenario, the adversary may be 
considered to be disentangled from the quantum devices, 
implying that pr oving secur i ty against classical-side in- 
formation as in (|Fehr et all |20 13t iPironio and Massarl 
120131 ) is already sufficient. 

Recently, protocols and security analysis have also 
been introd uced where some of t he above assumptions are 
relaxed. In (ISilman et all l2012f) , the separation assump- 
tion is relaxed and a small amount of cross-talk between 
the devices is allowed. This opens up the possibility of us- 
ing existent experimental systems with high data rates, 
such as Josephson phase q ubits on the same chip. In 



sucn as jo scpnson pnasc qubits on tnc same cmp. 111 
( Colbeck and Rennerl . l2~012| ). the problem of randomness 
amplification is introduced, which aims at extracting per- 
fect (or arbitrarily close to perfect) randomness from an 
initial source that is partly correlated with the devices 
and the adversary. It is shown that if one is given access 
to certain so-called Santha-Vazirani (SV) sources, then 
randomness amplification against an adversary limited 
only by the no-signaling principle is possible for certain 
parameters of t he source. Improving on this first result, 
it is shown in dGallego et all . l2012ah that an arbitrar- 
ily SV source can be amplified using certain multipartite 
quantum correlations. Finally, less stringent models of 
a co mpromised r andom seed than SV have been consid- 
ered (jHalll . 120111 ) and the conditions for Bell-based ran- 
domness e xpansion against an i.i.d. adversary has been 
studied in (|Koh et aU l2012f) . 



4. Device-independent quantum key distribution 

The protocols, the underlying assumptions, and the se- 
curity proofs for DIQKD are similar in spirit to DIRNG 
with the added complication that DIQKD involves two 
remote parties that must communicate over a public 
channel to establish the shared secret key. A typical 
DIQKD protocol consists of the following steps. A mea- 
surement step, where Alice and Bob measure a series 
of entangled quantum systems. An estimation step, in 
which Alice and Bob publicly announce a fraction of their 
measurement results to estimate the violation of a Bell 
inequality and the error rate in their raw data. An error- 



correction step, in which these errors are corrected using 
a classical protocol that involves public communication. 
Finally, a privacy-amplification step in which a shorter, 
secure key is distilled from the raw key based on a bound 
on the eavesdropper's information deduced from the Bell 
violation estimation. 

The first DIQKD protocol proven secure against gen- 
eral attac ks by a no-signaling eavesdropper was intro- 



duced in ( Barrett et all l2005al) . The protocol is based 
on the chained Bell inequality (f2"3"]l and produces a single 
secure key bit. It represents mostly a proof-of-principle 
result as the protocol is inefficien t and unable to toler - 
ate reasonable levels of noise. In ( Barrett et al I l2005al) 
security is proven assuming that each of the n entangled 
pair measured in the protocol is isolated from the other 
pairs. The protocol thus require that Alice and Bob have 
n separate pairs of devices, rather than a single pair of de- 
vices that they use repeatedly n times. The no-signaling 
conditions are required to hold between each of the 2n 
systems of Alice and Bob. This assumption is removed in 
(Bar rett et al l l2012h . where security is proven in the sit- 
uation where Alice and Bob have only one device each, 
which they repeatedly use. Instead of full no-signaling 
correlations among the 2n systems of Alice and Bob, the 
security is thus based on time-ordered no-signaling con- 
ditions, where no-signaling is only required from future 
inputs to previous inputs, but where later outputs can 
depend arbitrarily on previous inputs. 

Efficient an d noise-tole r ant pr o tocols have b e en in- 
troduced in (lAcm et all l2006at IScarani et al\ , |2006[ ) 
(see also ( Acfn et al. , l2006cl )) where however the se- 
curity analysis was restricted to individual attacks 
against no-signaling eavesdroppers. General security 
aga inst no-signa l ing eavesdropp e rs was later proven 
in (lHanggi et all 120101: iMasanesl 120091: iMasanes et all 
120091 ) under the assumption, as in ( Barrett et al. , 2005al) . 
that Alice and Bob use n separated pairs of devices con- 
strained by full no-signaling conditions. The question 
of whether it is possible to prove the security of an ef- 
ficient and noise-tolerant protocol in the case where Al- 
ice and Bob repeatedly use a single pair of devices con- 
strained by time-ordered no-signaling conditions is still 
open. One of the difficulty in obtaining such a result 
is related to the possibility of performing privacy am- 
plification against a nq - signa ling eavesdropper. It was 
shown in ( Hanggi et all [20091 ) that if no-signaling is im- 
posed only between Alice's device and Bob's one, but 
signaling within each device is allowed (so that out- 
puts can depend on later inputs), then privacy am- 
plification is not possible for protocols based on the 
CHS H inequality. This result was further extended in 
( Arnon- Friedman et all |2012( ) for a set of more gen- 
eral conditions, but still less restrictive than the de- 
sired time-ordere d no-signaling conditions. Recently , 
it was shown in ( Arnon- Friedman and Ta-Shmal I2012T ) 
that super-polynomial privacy amplification for protocols 
based on the chained inequality is impossible under the 
assumption of time-order no-signaling conditions. This 
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work still leaves open the question of exponential pri- 
vacy amplification for protocols based on a different Bell 
inequality or whether linear privacy amplification is pos- 
sible. 

Another line of results, concerned with security against 
eavesdroppers that are constrained by the entire quan- 
tum formalism and not only the no-s ignaling princi- 
ple, was initiated in ( Acfn et~al\ . l2007t ) . The advan- 
tage in this case is that better key rates and noise- 
resistance can be expected (as illustrated by the dif- 
ference between the randomness bounds (|59)l and (|60|) 
and that privacy amplification is possible and wel l stud- 
icd. The works (|Acin et all 120071 : iPironio et all . l2009h 
pro ved security of the CHSH-based protocol introduced 
in ( Acin et all . l2006ch against collective attacks by a 
quantum eavesdropper. This proof was extende d to a 
slightly more general setting in ( McKaguel . l2010b|) . Gen- 
eral security proofs of protocols based on arbitrary Bell 
inequalities under the assumption that the devices of Al- 
ice and Bob are memoryless (or equivalently that they 
use n non-interacting pairs of devices instead of a sin- 
gle one) were i ntrod uced in ( Hanggi and Reniierl . 120101 
M asanes et al I Html). The memory assump tion on the 
device was removed in ( Pironio et all l2012h . but secu- 
rity was only proven against quantum adversaries with 
classical-side information, a condition that is satisfied if 
the eavesdropper has only access to short-term quan- 
tum memories. The keyrates in ([Hanggi and Rennerl . 
l2010tlMasanes et all I20T1I IPironio et all\201$) are sim- 
ple expression expressed in term of single-copy bounds 
on the randomness of the form (|59|) . The general secu- 
rity of a CHSH-based protocol with no memory assump- 
tio ns on the device s or the eavesdropper was reported 
in ( Reichardt et all l2012al fbT). albeit it is polynomially 
inefficient and does not tolerate noisy devices. The secu- 
rity is obtained as a corollary of a more general strong 
testing result that allows the shared quantum state and 
operators of the two untrusted devices to be completely 
characterized. Finally, a complete DI proof of security 
of QKD that tolerates a constant noise rate and guaran- 
tees th£_generatioji_of_a linear amount of key was given 
in ( Vazirani and Vidickl [2012bh for a protocol that is a 
slight variant of Ekert's protocol. It is an open ques- 
tion whether this approach can lead to trade-offs between 
noise rate and key rate as good as the ones that have 
been shown to be achievable under additional memory 
assumptions on the devices or the eavesdropper. 

The general assumptions that underly the above proofs 
are similar to the ones for DIRNG: the validity of quan- 
tum theory or the no-signaling principle, access to a ran- 
dom seed independent of the devices and the eavesdrop- 
per, a separation assumption on the behaviour of the 
devices, and the implicit assumption that the devices do 
not leak out directly unwanted information to the eaves- 
dropper. Apart from that the devices are mostly unchar- 
acterized and no assumptions are made on the Hilbcrt 
space dimension, the specific measurements that are im- 
plemented, and so on. 



Note that in the dishonest-provider scenario where the 
devices are outright malicious and assumed to have been 
prepared by the eavesdropper, repeated implementations 
of a protocol using the same devices can render an earlier 
genera ted key insecure due to device-memory-based at- 
tacks dBarrett et all 120131) . In such attacks, untrusted 
devices may record their inputs and outputs and re- 
veal information about them via publicly announced out- 
puts during la t er im plementations of the protocol. Sec 
( Barrett et all |2013|) for a thorough discussion of the 
general scope of such attacks, including the possibilities 
of countering them by refined protocols. A countermea- 
sure relying on an encryption scheme which allows Alice 
and Bob to exchange data without the devices leaking 
information about previously generated ke ys to Eve is 
presented in ( McKague and Sheridanl [20121) . 

Finally, let us say a few words about experimental per- 
spectives for DIQKD. The implementation of a DIQKD 
protocol requires a genuine Bell violation over large dis- 
tances. Genuine, here, means with the detection loop- 
hole closed (at least if one is considering complete DI 
with no further assumptions on the devices), see subsec- 
tion [VILBTTJ Transmission losses in optical fibres, how- 
ever, represent a fundamental limitation for the realiza- 
tion of a detection-loophole free Bell test on any distance 
relevant for QKD. Approaches to circumvent the problem 
of transmission losses have been proposed based on her- 
alded qubit amplifiers ( Gisin et~al\ , 120101 Pitkanen et al. 



2011 ) and standard quantum relays (ICurtv and Moroder 



20111 ) , but an experimental demonstration still represents 
a great challenge. Improved data post-processing has also 
been proposed to inc r ease the tolerance to lost photons 
(|Ma and Lutkenhaust 120121) . 



D. Other device-independent protocols 

In a quantum experiment, the violation of a Bell in- 
equality reveals the presence of entanglement in a device- 
independent way. In fact, in some cases a much stronger 
statement can be made. Certain quantum correlations 
can only be reproduced by performing specific local mea- 
surements on a specific entangled state. Hence the ob- 
servation of such correlations allows one to character- 
ize an unknown source of quantum states, as well as 
the measurement devices, in a device-independent man- 
ner. For instance, the observation of maximal viola- 
tion of the CHSH inequality implies that the underlying 
quantum stat e is necessarily equivalent to a two-qubit 
singlet state ( Cirel'sonl . Il980() . Moreover, the measure- 
ment settings of b o th Al i ce and Bob must antic o mmut e 



( Braunstein et al I Il992t iPopescu and Rohrlichl . 
Another method, developed in ( Mayers and Yaol 



allows one to reach the same conclusion. Such procedures 
are termed self-testing of the singlet state. 

More formally, these works show the following. Con- 
sider an experiment involving a state \tp) and measure- 
ment operators M\ and M 3 B , with i, j = 1, 2. If a CHSH 
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value of S — 2\/2 is achieved, then the state is equivalent 
(up to local isometries) to a singlet state \4>-) and the 
measurement are to anti-commuting Pauli operators o\ 
for Alice with {<t 1 a ,(t a } = 2(5^1 (and similarly for Bob 
cr B ), in the sense that 

$(|V>)) = \junk)8\1)-) (62) 
§{M A M B \^)) = \junk)®a A a B \iP-) (63) 

where <f> = $a ® $b is a local isometry, and \junk) a 
state share by Alice and Bob. 

For a self-testing protocol to be practical it should be 
robust to small deviations from the ideal case, due for 
instance to experimental imperfections. A first proof 
of the r obustness of the Ma yers- Yao scheme was de- 



rived in dMagniez et al\ , 1200 61], late r con siderably simpli 



. i n rtiv . 

fiedin (|McKague and Moscall201ll) . In (jMcKague et all 
|2012| ). a framework for studying the robust self-testing 
of the singlet state was presented, which can be used 
to certify devic e-independently the entanglement frac- 
tion of a source (|Bardvn et all 120091, . More generally, it 
was sh own in the ground-breaking work ( Reichardt et all 
l2012al lbh that self-testing can be achieved in the CHSH 
scenario even if the devices feature a quantum memory. 
Loosely speaking, this means that the only way to achieve 
a violation of the CHSH inequality close to 2-^/2, is if the 
measured bipartite states are close to the tensor product 
of singlet states, and the measurements are the optimal 
CHSH measurements. 

Self-testing of other quantum states was also dis- 
cussed. In particular, the case of partially entangled 
bipar tite states was addressed in ( Yang and Navascuesl 
|20 12h . In the multiparti te setting , the case of 
graph states was dis cussed ( McKagua l2010aj) . while 
([Miller andlShl |2012| ) considered self-testing in XOR 
games. Also, the device-independent certi fication of "en- 
tangled measurements" was invest igated ( Rabelo et all 
l201ll ; IVertesi and Navascuesl 1201 ll) . 

Moving away from self-testing, an interesting develop- 
ment is the device-independent assessment of multipar- 
tite quantum entanglement. Notably, techniques for de- 
vising device-inde pendent witnesses of g enuine multipar- 
tite entang lement (|Bancal et "al. l. l2011bTl were developed. 
Moreover, ( Brunner et aZ.I . 2012h discussed how the struc- 
ture of multipartite entangled states can be characterized 
using Bell inequalities. 



V. INFORMATION-THEORETIC PERSPECTIVE ON 
NONLOCALITY 

As we have seen in the previous section, nonlocality 
can be seen as a resource for information processing and 
communication tasks and the no-signaling principle plays 
a fundamental role in this respect. We have also seen 
in Section [XT] that there exist no-signaling correlations 
that are more no nlocal than those of quantum theory, 
as pointed out in ( Popescu and Rohrlichl Il994l ) . If Ali 



and Bob had access to such PR-boxes they could imple- 
ment many of the protocols discussed earlier, from com- 
munication complexity to cryptography, often with much 
higher efficiency than what quantum correlations allow 
van Daml . l2005h . Mo-signaling non-local correlations can 



thus be viewed as i nformation-theoretic r esources and in- 
vestigated as such ( Barrett et all l2005b| ) . This new per- 
spective raises two general questions: Can we develop a 
resource theory of non-locality, similar to the resource 
theory of entanglement? What distinguishes quantum 
correlations from more general no-signaling correlations 
in this information-theoretic context? To answer them 
it is first useful to identify the physical properties which 
are generic to all no-signaling non-local theories. 



A. Properties of no-signaling correlations 

Remarkably, it turns out that many features of quan- 
tum mechanics, usually thought as counter-intuitive and 
genuinely quantum, are in fact general features of an 
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no-si gnaling theory featuring nonlocality ( Ma sanes et all 
2006). These include a no-cloning theorem, the 
monogamy of correlations, a disturbance versus informa- 
tion gain trade-off in measurements, the inherent ran- 
domness of measurement outcomes, the complementarity 
of measurements and uncertainty relations. These phys- 
ical properties are clearly relevant from an information- 
theoretic point of view, think for instance about the role 
that the no-cloning theorem or the monogamy of entan- 
glement play in quantum information science. The fact 
that such properties are generic to all no-signaling nonlo- 
cal theories thus already suggests that such theories offers 
interesting possibilities for information processing. 

We have already given the intuition in subsection llV.CI 
of why measurement outcomes must be random in any 
non-local no-signaling theory. Let us illustrate some of 
the other above properties with simple examples based on 
Popescu-Rohrlich type correlations. Consider that Alice 
and Bob share a PR-box (|36|) . i.e. correlations of the 
form 



/ 71 x I 2 a®b = xy 
p(ab\xy) = 

otherwise 



(64) 



where © is addition modulo 2, and x,y £ {0, 1} denote 
the inputs and a,b £ {0, 1} the outputs. The impossi- 
bility of having a perfect cloning machine is here easily 
derived by contradiction. Assume such a machine ex- 
ists. Then Bob could apply it to its subsystem, resulting 
in a tripartite probability distribution p(a, 61, 62 \x, y\, y 2 ) 
satisfying the relations 



a © bi = xyi , a © b 2 — xy 2 



(65) 



with a, b\,b 2 locally uniformly distributed. Combining 
the above relations leads to 



ice 



h © b 2 = x( Vl © 2/2) 



(66) 
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showing that Bob's marginal probability distribution di- 
rectly depends on x, the input of Alice, when Bob uses 
inputs such that y\ © 7/2 = 1. Thus, Alice can signal 
to Bob, which contradicts our basic hypothesis that the 
theory is non-signaling. Therefore, we conclude that a 
perfect cloning machine cannot exist in a theory featur- 
ing PR-box c orrelations. General and rigorous pr oofs can 
be found in (jBarretti l2007t iMasanes et all 120061 ) . 

The above simple example also indicates that no- 
signaling correlations are constraint by monogamy re- 
lations (see Section |VLC|) . In particular, a PR-box be- 
ing an extremal point of the no- signaling set must be 
decoupled from any other system ( Barrett et all l20"05bl 
IMasanes e*"aEl2006t ). 

As a last example, let us illustrate the existence of a no- 
tion of complementarity of measurements in generalized 
non-signaling theories. Considering again PR-box cor- 
relations, the two possible measurements on Bob's side 
(corresponding to y = and y = 1) cannot be compati- 
ble, that is, there cannot be a single joint measurement Y 
returning outcomes 60 and 61 corresponding respectively 
to y = and y = 1. Indeed, this would imply the exis- 
tence of a distribution P(ab bi\xY) satisfying bo ®b\ = x 
(since a © 60 = and a © b\ = x), thus violating no- 
signaling as in the above example. 



B. Nonlocality measures, interconversion and distillation 

If nonlocal boxes can be viewed as a information- 
theoretic resource, can we define a theoretical framework, 
analogous e.g. to the framework that has been developed 
for the study of entanglement, that would allow us to 
answer unambiguously questions such as, can two given 
sets of nonlocal correlations be considered equivalent re- 
sources, or, what is a good measure of nonlocality? 

A prerequisite for addressing these issues is to under- 
stand interconversion between nonlocal boxes, that is, 
the simulation of a given nonlocal box using a supply of 
other nonlocal boxes. In this context, separated parties 
are allowed to perform local operations on their boxes. 
They can relabel the inputs and outputs, and also 'wire' 
several boxes, using for instance the output of one box 
as an input for another box. Importantly, classical com- 
munication is not allowed, as it represents a nonlocal re- 
source, which allows trivially for the simulation of any 
nonlocal box. 

The i nter-conversion of bipa r tite boxes has bee n stud- 



ied in dBarrett et all l2005bl iForster and Wolfl 1201 it 



I Jones and Masanesl . 2005 ) and is by now relatively well 
understood. The main conclusion to be drawn from 
these works is that the PR box represents a good unit 
of bipartite nonlocality — much like the singlet in the 
case of entanglement — in the sense that any bipartite 
no-signaling box can be simulated to an arbitrary pre- 
cision using a supply of PR boxes ( Forster and WolA 
l201ll ). In the multipartite case, the situation is more 
complicated. On the one hand several classes of ex- 



tremal nonl ocal boxes ca n be simulatea exactly using 
PR boxes ([Barrett et all l2005bl iBarrett and Pironiol 
120051) . On the other hand there exist non-signaling boxes 
which can provably not be approxi mated using an arbi- 
traril y large supply of PR boxes ( Barrett and Pironiol 
120051 ; iPironio et all |201 lh . In particular, there ex- 
i st quantum non-local cor relations with this property 
(jBarrett and Pironiol [20051 ). It is still an open question 
whether there exists a unit of multipartite nonlocality; in 
fact even proposing a good candidate is challenging given 
the complexity of the set of multipartite non-signaling 
correlations (see Section Hl.Dp . 

Another relevant issue is whether nonlocality can be 
distilled. That is, from a supply of weakly nonlocal 
boxes is it possible to obtain via local operations (i.e. 
relabelings and wirings) one copy of a box featuring 
more nonlocality, in the sense that it violates more a 
given Bell inequality than the original boxes. Inter- 
estingly, nonlocality distill ation is poss i ble fo r certain 
classes of nonlocal boxes ( Forster et al I l2009h . More- 
over, maximally nonlocal PR box correlations can be 
distilled out of certain boxes with arbit rarily weak non- 
locality ( Brunner and Skrzypczvkl l2009f) . i.e. violating a 
Bell inequality by an arbitrarily small amount. The ex- 
istence of such distillation protocols has important con- 
sequences from an information-theoretic point of view. 
For instance, if a certain class of boxes can be distilled 
to a PR box, then all boxes in this class inherit the 
information-theoretic power of the PR box. Note also 
a series of negative results, concerning in particular the 
impossibility of distilling isotropic nonlocal correlations. 
Such correlations — mixtures of PR box and white noise — 
are of particular importance, since any nonlocal box can 
be 'depolari zed' to an isot r opic w ithout decreasing its 
nonlocality (IMasanes et all 20061). Partial no-go theo- 
rems have b e en de rived (jDukaric and Wolil2008lrForsterl . 
120111 ; IShortl 120091) . but a full proof is still missing. 

These developments have opened novel possibilities for 
defining natural mea sures of nonlocality, such as the 'dis- 
tillab le nonlocality' ( Brunner et all [201 it IForster et all 
I2009D of a nonlocal box, the maximal amount of non- 
locality that can be extracted from an arbitrarily large 
supply of such boxes. First steps towards establishing a 
more genera l resource theory of nonlocality have recentl y 
been taken ( Brunner et al l l201lllGallego et al. I. l2012bh . 

Finally, it is interesting to look for sets of correlations 
which are invariant under local operations. A set is said 
to be closed under wirings if, by combining correlations 
of this set via local operations, it is impossible to gener- 
ate correlations outside the set . The study of such sets 
was initiated in ( Allcock et all l2009al ) . Clearly the sets 



of local, quantum, and no-signaling correlations are all 
closed under wirings. Finding other closed sets appears 
to be a nontrivial problem. An interesting open problem 
is whether there exist, in the CHSH scenario, a strict 
subset of the no-signaling polytope, that is closed un- 
der wirings and features more nonlocality than quantum 
mechanics (i.e. violating Tsirelson's bound). 
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C. Consequences of superstrong non-locality 

The existence of no-signaling correlations stronger 
than quantum mechanical ones raises fundamental ques- 
tions. Why is nonlocality limited in quantum theory? 
Would there be unlikely consequences from a physical or 
information-theoretical point of view if supra-quantum 
correlations were available? Can we identify reason- 
able principles that allow us to characterize the bound- 
ary that separate quantum from supra-quantum correla- 
tions? The works discussed below address such questions. 
The first sections below deal with more information- 
theoretic consequences of supra-quantum nonlocality and 
the second one is based on more physical concepts. 



1. Information-theoretic consequences 

a. Communication complexity and nonlocal computation. 
A first result showing a sharp difference between quan- 
tum and super-quantum correlations in their capability 
of performing in formation-theoretic tasks was given in 
van Dad . 120051) n the context of communication com- 



plexity. As discussed in subsection IIV. Al communication 
complexity deals with the problem of determining the 
number of bits that Alice and Bob need to exchange to 
compute the value f(x,y) of a function whose inputs x 
and y are distributed among Alice and Bob. The amount 
of communication that is required depends on the par- 
ticular function / and the resources that are available 
to Alice and Bob. Restricting to binary (also called 
boolean) functions f(x,y) : {0,1}™ x {0,1}" -> {0,1} 
taking n-bit strings X — X\ • • • X fi and y = y\ . . . y n as 
inputs, it is known that some of such functions have high 
communication complexity, basically Alice must send her 
entire bit string x to Bob, even if Alice and Bob are 
allowed to share unlimited prior entanglement. An ex- 
product 

a nil). 

trast, if unlimited PR-boxes were available to Alice and 
Bob, then a single bit of classical communication from 
Alice to Bob is sufficient for Bob to evaluate any binary 
function, that is, communication complexity collapses. 

Consider again the inner product function. Suppose 
that Alice and Bob share n PR boxes and receive inputs 
x and y. In each box i, they input n and yi respectively, 
and get outcomes ai and 6, satisfying a; Obi = xiyi. The 
inner product function can be rewritten as 

f(x, y) = ^ x *yi = /, a i®bj= y^a» © y^frj . 



ample of such a function i s the inner produc t function 
f(x,y) = x ■ y = Yl<i x iyi ( Cleve et all Il998 l). In con- 



.1/ 



ice s side 



Bob' 



Thus Alice can compute locally c = Yli a i> ano - sen d 
the single bit c to Bob who then outputs c © b, where 
b = ^ bi, which is indeed the inner product. The inner 
product function is of particular importance, since any 
binary function / can be decomposed into inner prod- 
ucts, from which the result of van Dam follows. 



This idea was later generalized to the context of prob- 
abilistic communication complexity where Alice and Bob 
must compute / with a minimum probability of success 
([Brassard et ad . l2006l ). It was shown that certain noisy 
PR boxes, with CHSH value S > 4^/2/3 3.266 make 
communication complexity trivial in this scenario. Fi- 
nally, using nonlocality distillation, it can be shown that 
(non-quantum) boxes with an arbitrarily small amount 
of nonlocali ty can nevertheless collapse com munication 
comp lexity ( Brunner and Skrzypczvkl . l2009h . 

In ( Linden et all I2007T ) , a task closely related to com- 
munication complexity, termed nonlocal computation, 
was introduced. The binary function / that Alice and 
Bob must compute has the special form f(x,y) = g(x © 
y) = g{z) where g{z) is a boolean function taking as in- 
put an n-bit string z (with z% = x% © yi and Xi uniform 
for i — l,..,n). Thus each party has locally no infor- 
mation about the functions input z. Alice and Bob are 
asked to output one bit, respectively a and b, such that 
a©6 = f(x,y) = g(z). The figure of merit is then the av- 
erage success probability of Alice and Bob. While strate- 
gies based on quantum correlations offer no advantage 
over classical ones for the nonlocal computation of an ar- 
bitrary function, it turns out that c ertain super-quantum 
correlations provide an advantage ( Linden et all [20071 ) . 
Remarkably, if one consider as a function the nonlocal 
AND of two bits g{z\, z^) = Z\z%, then the limit at which 
noisy PR-boxes stop providing an advantage over clas- 
sical and quantum correlations corresponds exactly to 
Tsirelson's bound. Note, however, that when the dis- 
tribution of inputs is not perfectly uniform, i.e. when 
Alice and Bob have partial knowledge (even arbitrarily 
small) about the function's input z, quant um correlations 
provid e an advantage over classical ones ( Allcock et all 
l2009ch . 



b. Information-causality. Suppose Alice sends an m-bit 
message to Bob. How much information is potentially 
available to Bob? A natural guess is that the amount of 
information potentially available to Bob is equal to what 
he receives, that is, m bits. This is in essence the prin- 
ciple of information causality: the amount of informa- 
tion potentially available to Bob about Alice's data is not 
higher than the amount of information Alice sends to him 
( Pawlowski et al .. 2009). While both classical and quan- 
tum physics satisfy information causality, this is in not 
the case in general, if supra-quantum correlations were 
available. Hence information causality can be viewed a 
strengthening of the no-signaling principle. 

To see how super-quantum correlations can violate 
information-causality, suppose that Alice is given two 
classical bits xq and x±, each uniformly distributed. Bob 
is interested in learning one of these two bits, but Alice 
does not know which one. To make her life complicated 
she is allowed to send only one bit to Bob. Can they 
devise a protocol such that Bob can always retrieve the 
desired bit? In a scenario where Alice and Bob share only 
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classical or quantum resources, the answer is no. How- 
ever, if Alice and Bob share PR boxes , the task becomes 
possible (jWolf and WullschlegerLl2005l ). Alice first inputs 
xq © x\ in her end of the PR box, and gets outcome a. 
She then sends the 1-bit message to Bob: to = a © xq. 
Bob, who is interested in bit Xk of Alice, inputs k on his 
end of the PR box, and gets outcome b. Upon receiving 
Alice's message to, Bob makes his guess G = 6© to = Xk- 
Hence, Bob's guess is always correct. 

The principle of information causality allows to 
recover part of the boundar y between quantum 
and super-quantum co rrelations ( Allcock et all l2009bt 
iPawlowski et al. 1. 120091 ). Notably, any theory that allows 
for the violation of Tsirelson's bound violates information 
causality. 



c. Limitations on multipartite correlations. The princi- 
ples discussed above focus on bipartite correlations. A 
non-local game t ermed guess your ne ighbour's input 
was introduced in ([Almeida et all l2010al) . which reveals 
an intriguing separation between quantum and super- 
quantum correlations in a multipartite context. Consider 
n distant parties placed on a ring. Each party i is given 
an input bit Xi according to a joint prior probability dis- 
tribution p{x\ . . . x n ). As the name of the game suggests, 
each party is then asked to give a guess of his right 
neighbor's input, i.e. such that a t = a^+i for all i = l...n. 
Since a high probability of success at this game would 
lead to signaling, it is not surprising that quantum re- 
sources provide no advantage over classical ones, for any 
distribution of the inputs. However, it turns out that 
certain no-signaling super-quantum correlations outper- 
form classical and quantum strategies for certain distri- 
butions of the inputs. Remarkably, some of these games 
correspond to facet Bell inequalities. Hence 'Guess your 
neighbour's input' identifies a portion of the boundary of 
the quantum set which is of maximal dimension. This 
quite innocuous game has several rather surprising ap- 
plica tions, related to generalizations of Gle ason's theo- 
rem (|Acm et all l2010t IBarnum et all l2010h and to un- 



201 lh . 



extendible product basis (jAugusiak et all 

The motivation for many of the results discussed above 
is to identify general properties or a set of principles 
that potential l y sing le out quantum correlations. In 
dGallego et all 120111) . it was shown that any such prin- 
ciples must be genuinely multipartite. More specifi- 
cally, there exist tripartite super-quantum correlations 
which arc local among every possible bipartition (even if 
many co pies of them a r e ava i lable and w i rings are per- 
formed) (|Gallego et all l201lt lYang et all I2012D . Thus, 
no bipartite principle can ever rule out these correla- 
tions. Such super-quantum correlations can nevertheless 
be ruled out by a n o vel p rinciple termed 'local orthog- 
onality' ( Fritz et~al\ , |2012| ). inspired from the game of 
'Guess your neighbour's input'. 



2. Physical consequences 

a. Macroscopic locality. Loosely speaking, macroscopic 
locality is a principle requiring that nonlocal correlations 
admit a classical limit. More specifically, in a Bell test 
involving a large number of pairs of particles, the statis- 
tics of coarse-grained measurements (not resolving dis- 
crete particles) should admit an explanation in terms of 
a local model, i.e. should no t viola te any Bell inequality 
(jNavascues and Wunderlich . 20091). Th is is the case in 
quantum mechanics ( Bancal et all\206§i) , but not in gen- 
eral no-signaling theories. Notably, the set of correlations 
satisfying macroscopic locality can be completely charac- 
terized. It corresponds to the set Q\, the first approx- 
imation to the set of quantum correlations in the hier - 
archy of semi-definite programs (jNavascues et al I I2007D 
discussed in subsection Ill.C.l.dl This set is however 
strictly larger than the quantum set. Thus, there are 
super-quantum correlations that still sat i sfy m acroscopic 
locality. It was shown in ( Yang et all 1 2 llh that an- 
alytical quantum Bell inequalities can be derived from 
macroscopic locality. Finally, note that there exist corre- 
lations satisfying macroscopic locality which neverthe less 
violate information causality ( Cavalcanti et all\201(t) . 



b. Uncertainty and information. In ( Wehner et all . 120081 ) 



it was shown that one can reformulate any Bell inequality 
in the language of information, which for projection non- 
local games (see Section [H.B.4.c[) works as follows. For 
every question x and answer a of Alice, one can write 
down a string s XjQ = (s£} a , . . . , S£$) where s^i = b is 
the answer that Bob must return for question y in order 
for them to win the game. Written in this way, one may 
think of the state of Bob's system conditioned on Alice 
measuring x and obtaining outcome a as an encoding of 

(v) 

the string s x . g from which Bob must retriev e entry Sx, a 
correctly. In ( Qppenheim and Wehnerl . l2010f ) it was fur- 
thermore shown that for any physical theory uncertainty 
relations can be understood as imposing limits on how 
well we can retrieve information from an encoding. This 
information-theoretic perspective is the essential idea be- 
hind the relation between nonloca l ity an d uncertainty 
found in ( Qppenheim and Wehnerl . l2010h . which holds 
for any physical theory. It should be noted that the aim 
of ( Qppenheim and Wehnerl . |2010T ) is not to derive lim- 
its on nonlocality by appealing to intuitive notions on 
how we expect information to behave, but rather to link 
it to another concept already existing within quantum 
mechanics. 



c. Local quantum mechanics. In ( Acin et all l2010t 
IBarnum et~al\ . l2010h it was shown that if stronger than 
quantum correlations were observed, then quantum me- 
chanics cannot hold even locally. More precisely, it was 
shown that if the no-signaling principle holds, and Alice 
and Bob are locally quantum, then all possible corre- 
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lations between them admit a quantum mechanical de- 
scription. It should be noted that the two conditions of 
no-signaling and local quantum mechanics ar e insufficient 
to re strict tripartite correlations in this way ( Acm et all 
l2010h . 



D. Non-locality in generalized probabilistic theories 



of nonlocal correlations depends strongly on the geom- 
etry. As the amount of uncertainty in a theory bounds 
the geometr y of the state space, this provid es a very nice 
insight into ( Qppenheim and Wehnerl . l201dh . An intrigu- 



ing consequence of this result is there exist models that 
are locally almost indistinguishable from quantum me- 
chanics, but can nevertheless generate maximally nonlo- 
cal correlations. 



The idea of investigating the information-theoretic 
power of non-local correlations mor e general than quan- 
tum on e s led , following work of dBarrettl . 120071) and 
( Hardvl . l200ll ). to a very active line of research in 
which information processing has been considered in the 
broader framework of "general probabilistic theories" 
(GPT) or "convex-operational" formalism. This frame- 
work allows one to define full-fledged theories (i.e. that 
include notions of states, evolution, measurements and 
not only "correlations" ) in which classical and quantum 
theories are merely two special cases. Given such a for- 
malism, one can compare and contrast quantum theory 
with other alternative theoretical models. The hope is 
to better understand quantum theory and identify in 
what ways it is special. To date, much work has fo- 
cused on information processing in GPT, investigating 
for instance cloning, broadcasting, teleportation, or en- 
tanglement swapping. Even if these works connect and 
partly overlap with many of the issues mentioned above, 
we do not review this very fruitful work here as it does 
not directl y take nonlocality as a star ting point. We refer 
instead to ( Barnum and Wilcel . 12012]) for a short review. 
In what follows, we only mention work that explicitly 
consider Bell nonlocality in the context of GPT. 

In (jSteeg and Wehnerl . I2009T) . it was shown that super- 
strong random access encodings exist in certain matrix 
based theories that violate the CHSH inequality beyond 
Tsirelson's bound. A quantum random access code is 
an encoding of an n-bit string x = x\, . . . ,x n <E {0, 1}™ 
into a quantum state p x € such that each bit 

Xj can be retrie ved from p x with some probability pj. 



In (|Navakl li"999Tl it was shown that if the state has di- 
mension at most dim('H) = d, then the success prob- 
abilities are bounded as $^ x (l — h(pj)) < logd, where 



h{p) = -ylozn p - (1 - p) lo 
tropy. In (jSteeg and Wehne 



p) is the binary en- 
it was shown that 
this inequality can be violated for some theories that 
allow stronger than quantum correlations, i.e., super- 
strong random access encodings exist in such theories. In 
particular, there exists generalized "states" in a Hilbert 
space of dimension d which effectively contain more than 
d bits of information. 

In ( Janotta et all l201ll ) it is shown that there is a con- 
nection between the strength of nonlocal correlations in 
a physical theory and the structure of the state spaces of 
individual systems. In particular, a class of GPTs is pre- 
sented that allows to study the transition between classi- 
cal, quantum and super-quantum correlations by varying 
only the local state space. It is shown that the strength 



VI. MULTIPARTITE NONLOCALITY 

In the multipartite case, nonlocality displays a much 
richer and more complex structure compared to the case 
of two parties. This makes the study and the character- 
ization of multipartite nonlocal correlations an interest- 
ing, but challenging problem. It comes thus to no surprise 
that our understanding of nonlocality in the multipartite 
setting is much less advanced than in the bipartite case. 

The study of multipartite n onlocality was ini tiated by 
the ground-breaking work of ( Svetlichnvl Il987l ) . In this 
paper, the author introduced the concept of genuine mul- 
tipartite nonlocality, derived a Bell-type inequality for 
testing it, and showed that that this strong form of non- 
locality occurs in quantum mechanics. Later, in particu- 
lar with the advent of quantum information science, the 
concepts and tools introduced by Svctlichny were further 
developed. 

In this section, we will start by defining various no- 
tions of multipartite nonlocality (with a particular focus 
on genuine multipartite nonlocality) and discuss the de- 
tection of multipartite nonlocality. Next, we discuss the 
notion of monogamy of nonlocality, which limits nonlo- 
cality between different subsets of parties. Finally, we 
discuss the nonlocality of multipartite quantum systems. 



A. Defining multipartite nonlocality 

In the case of two separated observers, the notion of 
nonlocality is unambiguously described using the defini- 
tion of Bell (see Section [XJ) . In the case of three or more 
separated observers, the situation becomes more com- 
plex, and there exist several inequivalent notions of non- 
locality. For instance, considering n separated observers, 
it could be that (i) a subset of k < n parties are non- 
localy correlated, while the remaining n — k parties are 
only classically correlated, or (ii) that all n parties are 
nonlocally correlated. Clearly the latter notion of non- 
locality is stronger than the former. This is referred to 
as genuine multipartite nonlocality, which represents the 
strongest form of multipartite nonlocality. The main pur- 
pose of this section is to discuss the problem of defining 
formally, in the spirit of Bell's definition, this concept of 
genuine multipartite nonlocality. 
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1. Genuine multipartite nonlocality: Svetlichny's approach 



2. Beyond Svetlichny's model 



Svctlichny proposed the first definition of genuine mul- 
tipartite nonlocality. For simplicity and clarity, we will 
describe here the case of three separated parties, Alice, 
Bob and Charlie. The measurement settings are denoted 
x, y, z and their outputs by a, &, c for Alice, Bob, Charlie 
respectively. The experiment is thus characterized by the 
joint probability distribution p(abc\xyz). As mentioned 
above, there exist different notions of nonlocality which 
the correlations p{abc\xyz) can exhibit. 

First, the experiment can display 'standard' nonlocal 
correlations, that is, the probability distribution p cannot 
be written under the local form: 

pi{abc) = / d\p(\)px{a\x)p\{b\y)p\(c\z) (67) 



where A is a shared local random variable and J d\p(X) = 
1. This represents the natural generalization of Bell's lo- 
cality condition (j3|) to the multipartite case. However, 
this notion of nonlocality does not capture the idea of 
genuine multipartite nonlocality. For instance, in the 
case where Alice and Bob are non-locally correlated, 
but uncorrelated from Charlie, it would still follow that 
p(abc\xyz) cannot be written in the form (|67[) . although 
the system clearly features no genuine tripartite nonlo- 
cality. 

To detect genuine multipartite nonlocality, one needs 
to ensure that the probability distributions cannot be re- 
produced by local means even if (any) two of the three 
parties would come together and act jointly — and con- 
sequently could reproduce any bipartite nonlocal prob- 
ability distribution between them (even possibly featur- 
ing signaling). Formally, this amounts to ensure that 
p(abc\xyz) cannot be written in the form: 



p s (abc\xyz) = / d\p(\)p x (ab\xy)p\(c\z) 

dpp(p-)p f ,(bc\yz)p ll (a\x) (68) 
+ / dvp(v)p v (ac\xz)p v (b\y) 



where J d\p(X) + J dpp(p) + J dvp(v) = 1. A probability 
distribution p(abc\xyz) which cannot be expressed in the 
above form features genuine tripartite nonlocality in the 
sense of Svetlichny; to be reproduced by classical means, 
all three parties must come together. Detecting such a 
form of multipartite nonlocality is obviously an impor- 
tant issue. Standard Bell inequalities can in general not 
be used here, hence better adapted tools are needed (see 
Section |VLB|- 

While Svetlichny's notion of genuine multipartite non- 
locality is often referred to in the literature, it has certain 
drawbacks which will be discussed below, as well as al- 
ternative approaches addressing these issues. 



In Svetlichny's definition of genuine multipartite non- 
locality, when several parties group, they are allowed to 
display arbitrary correlations. In particular, this includes 
signaling probability distributions. For instance, consid- 
ering again the above tripartite example, the bipartite 
probability distributions, e.g. p\(ab\xy), entering decom- 
position are unconstrained, apart from normaliza- 
tion. In particular, this means that we have not imposed 
the no-signaling constraints: 



P\(a\xy) 
P\(b\xy) 



P\{a\xy') 
P\(b\x'y) 



Va,x,y,y' 
Vb,x,x',y 



(69) 
(70) 



where p\ (a | xy) = ~Y^ b p\{ab\xy) is Alice's marginal prob- 
ability disitribution, and similarly for Bob. These condi- 
tions impose that Alice cannot send a message to Bob by 
choosing her measurement setting, and vice versa. Prob- 
ablility distribution failing to satisfy (at least) one of the 
above constraints lead to signaling. Signaling from Alice 
to Bob occurs when condition ((70]) is not satisfied. Simi- 
larly, signaling from Bob to Alice occurs when (|6"9"]) is not 
satisfied. 

In Svetlichny's definition (|68|) . it is possible to con- 
sider probability disitrbutions p\(ab\xy) which signal 
both ways, i.e. which violate both conditions (|69[) and 
(|70|) . While this is fine from a formal point of view, 
such two-way signaling probability distributions are in- 
consistent from a physical po int of view since they lead to 
grandfather- type paradoxes dBarrett et al . 201ll ) as we ll 
as from an operational viewpoint ( Gallego et all l2012bl) . 

This suggests an alternative definition of genuine mul- 
tipartite nonlocality based on time-ordering, thus pre- 
venting from two-way signaling. Basically, building upon 
Svetlichny's defintion, one now additionally demands 
that all bipartite correlations, e.g. p\(ab\xy), are time- 
ordered. The set of 2|1 time-ordered local correlations 
(6>™) contains all distributions that can be written in3 
the form 



p{abc\xyz) = 



d\p{\)pl AB (ab\xy) Px (c\. 



d^p{p)p'l AC {ac\xz)p p ,(b\y) (71) 



dvp(v)p v BC (bc\yz)p„(a\x) 



= p A < B (ab\xy); 
we have that 



where p^ AB (ab\xy) denotes a probability distribution 
that is time-order dependent: when Alice measure be- 
fore Bob, we have that p^ AB (ab\xy) 
when Bob measures before Alice, 
p r ^ AB {ab\xy) = p B<A (ab\xy). Indeed consistency implies 
that p A<B (ab\xy) and p B<A (ab\xy) are both (at most) 
1-way signaling; p A<B \ab\xy) is such that only Alice can 
signal to Bob, while p B<A (ab\xy) is such that only Bob 
can signal to Alice. 
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A probability disitrbution p{abc\xyz) that cannot be 
written in the form (|7ip is genuine multipartite nonlo- 
cal. Also, it turns out that the above definition (fTTj) is 
inequivalent to that of Svetlichny (|68p . In particular, one 
has that ^2j\ ^— t ^2|i e ^ - 

More generally it was also suggested to c onsider mod- 
els w ith arbitrary communication patterns ( Jones et all 
120051 ) . Multipartite nonlocality can then be quantified by 
looking for the model using the smallest amount of com- 
munication (for a given pattern). It is also possible to 
consid er the case in which certain parties broadcast their 
input (|Bancal et all l2009h . Then the minimal number 
of broadcasting parties needed for reproducing certain 
correlations gives a measure of the nonlocality these cor- 
relations. 



where C denotes the set of fully local correlations defined 
by flST]). 



B. Detecting genuine multipartite nonlocality 

After having defined the concept of genuine multipar- 
tite nonlocality, it is natural to ask how one can detect 
this strong form of multipartite nonlocality. Beyond the 
fundamental interest, such a question finds a natural con- 
text in quantum information theory and in the study of 
many-body systems. First, the presence of genuine mul- 
tipartite non-locality implies the presence of genuine mul- 
tipartite entanglement in a device-independent way. Also 
it is a fundamental issue to determine the role played by 
nonlocality in quantum information processing. 



3. Approach based on no-signaling 

A simpler (but more stringent) definition of genuine 
multi partite nonlocal i ty is ba sed on the no-signaling prin- 
ciple (lAlmeida et all l2010bl ). Such approach is rather 
natural for the study of quantum correlations, which 
are inherently no-signaling. In this approach, one im- 
poses that all parties, even when they come together, 
still display non-signaling correlations. This approach 
ca n be seen as a multi partite generalization of the ideas 
of (|Elitzur et ali Il992h (see Section [03213]) • 

Consider a tripartite distribution P(abc\xyz), and de- 
compose it in the following way 



p(abc\xyz) = qipf' B:C + qA-.BCP'i 



A-.BC j_ „ „C:AB 
+ qC:ABPi 



iB-.ACPl 



B-.AC 



QnsPn 



(72) 



whci'C qi + q A -.B:C + qA-.BC + qAB-.C + qB-.AC + qn 



Here pj 



A-.B-.C 



J dXp(X)p(a\x, X)p(b\y, X)p(c\z, A) is fully 



local, and the distributions p l ^ k are no-signaling bi-local. 
For instance, pf' BC = J dXp(X)p A (a\x, X)p BC (bc\yz, A), 
where p BC (bc\yz 7 A) is no-signaling; similarly for pf ' AC 
and pf ' AB . The distribution p{abc\xyz) is then said to 
be genuine multipartite nonlocal when q ns > for any 
possible decomposition of the form (fT2"j) . The set of corre- 
lations which are not displaying this form of nonlocality, 
i.e. admit a decomposition (|72|) with q ns = is denoted 
S^\. Moreover, p(abc\xyz) is said to be fully genuine 
multipartite nonlocal when q ns = 1 for any possible de - 
composition of the form (fT2"j) . In ( Almeida et all [201 Obi) , 
a criterion to detect the latter form of nonlocality has 
been given. 

Finally, it is interesting to compare all three defini- 
tions of genuine multipartite nonlocality introduced in 
this section. It turns o ut that none o f them a re equivalent 
([Barrett et all |201ll; iGallego et al. I, l2012bf ). Thus the 
sets of correlations (fulfilling each definition) are strict 
subsets of each other, according to the following hierar- 
chy 



oris Qto cSvet 
C_ D 2 |i <- <->2|l C D 2|l 



(73) 



1. Svetlichny's inequality 

The first method to det ect genuine multip artite non- 
locality was presented in ( Svetlichnvl 119871). Focusing 



on a tripartite system, the author derived a Bell-type in- 
equality which holds for any distribution of the form (|68p . 
Thus a violation of such inequality implies the presence of 
genuine tripartite nonlocality, implying in turn the pres- 
ence of genuine tripartite entanglement. 

Let us focus on the case where each party j performs 
one out of two possible measurements denoted Xj and 
Xj. All measurements are dichotomic, hence their results 
are denoted by aj = ±1 and a'j = ±1. Svctlichny then 
proved that the inequality 

53 — aia2a' 3 + a\a 2 a^ + a\a2ai — a^a^a'^ (74) 
a^a^a^ + a'j^a^ + aia 2 a 3 — aia2(i3 < 4 



holds for any probability distribution of the form 
Note that the above polynomial should be understood as 
a sum of expectation values; for instance ai02a 3 stands 
for the expectation value of the product of the measure- 
ment outcomes when the measurements are xi, 22, and 
x' 3 . 

To get more intuition about Svetlichny's inequality, 
and to prove that its violation implies the presence of 
genuine multip artite nonlocality, we follow the simple ap- 
proach of Ref . ( Bancal et all l2011al ) . We first rewrite the 
inequality as follows: 



S 3 



Sa'+S'a 3 < 4 



(75) 



where 5* = aia 2 + a\a' 2 + a[a 2 — a[a' 2 is the CHSH ex- 
pression, and S' = a[a 2 + a[a 2 + a\a' 2 — a\a 2 one of 
its equivalent forms obtained by permuting primed and 
non-primed measurements. Now observe that it is the 
input setting of Charlie that defines which version of the 
CHSH game Alice and Bob are playing. When C gets 
the input x' 3 , then AB play the standard CHSH game; 
when C gets the input 23, AB play its symmetry. Hence 
it follows that S3 < 4 holds for any bipartition model of 
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the form (|68|) . Consider the bipartition A/BC. B knows 
which version of the CHSH game he is supposed to play 
with A, since he is together with C. However, CHSH be- 
ing a nonlocal game, AB cannot achieve better than the 
local bound (i.e. S = 2 or S' = 2), as they are separated. 
Thus it follows that S3 < 4 for the bipartition A/BC. 
Note that the same reasoning holds for the bipartition 
B/AC. Finally, since the polynomial is symmetric under 
permutation of the parties, it follows that S3 < 4 for 
all bipartitions. The inequality ([74]) detects the genuine 
multipartite nonlocality of important classes of quantum 
states, such as GHZ and W (see Section fVLD I) . 



2. Generalizations to more parties, measurements and 
dimensions 

Svetlichny's inequality has been generalized to a 
scenario featu r ing an arbitrary number of parties n 
([Collins et all l2002bl; ISeevinck and Svetlichnvl |2002l) . 
By repeating the procedure which allowed us to build 
Svetlichny's inequality from CHSH (see (|75p) we get 



S n — Sn—i a n + S n _i cl„ < 2 



(76) 



where S' n _ 1 is obtained from S n -i by applying t he map - 
ping ai — > a' x and a\ — > at ( Bancal et all l2011al) . 
Note also that generalizations to the most general sce- 
nario, featuring an arbitrary number of parties, measure- 
ments and systems o f arbitrary dimensions, was derived 
dBancal et all l2011al) . 

Finally, note that Bell inequalities detecting notions of 
genuine multipartite nonlocality more refined than that 
of Sveltic hny (see Sections IVI A.2I and IVI.A.3|) were pre- 
sented in ( Barrett et oD. l201ll> . 



C. Monogamy 

The monogamy of nonlocal correlations is nicely illus- 
trated by considering the CHSH inequality in a tripar- 
tite scenario. Let Alice, Bob, and Charlie have two pos- 
sible dichotomic measurements, represented by observ- 
ables {A x } x , {B y } y and {C z } z with x,y,z € {0, 1}. We 
can now evaluate the CHSH expression for Alice-Bob, 
and Alice-Charlie. Denote by Bab ancl Bag denote the 
corresponding Bell operators for the CHSH inequality as 
defined in Section IH.C.l.al It is important to note that 
Alice's measurem ents are the same for bot h inequalities. 
It was shown in ( Scarani and Gisml l200lh that, for any 
3-qubit state shared by the parties, if {Bab) > 2 then 
(Bac) < 2. That is, if the statistics of Alice and Bob vio- 
late the CHSH inequality, then th e statistics of Alice and 
Char lie will not. More generally, ( Toner and Verstraetd . 
120061) showed that for an arbitrary quantum state shared 
by the three parties, we have that 



(Bab) 2 + (Bag) 2 < 



(77) 



Note again that if Alice and Bob violate their CHSH 
inequality, then Alice and Charlie do not. Moreover, if 
Alice and Bob observe maximal CHSH violation, we have 
from Q that (Bab) 2 = 8 and hence by (J77J the data of 
A and C are uncorrelated. Monogamy of correlations, 
however, is not specific to the CHSH inequality but ap- 
plies to essentially all bipartite Bell inequalities. In the 
lan guage of games (Section III. B.4l). this has been used 
in ijlto and Vidickl . l2012t iKempe et al\ . |2008() to " immu- 
nize" a nonlocal game against the use of entanglement. 

It is interesting to note t hat even no- si gnaling 
correlations are monogamous (IBarrett et al. I. l2005bl: 
lMasanes_e£a/ll2006t|Pawlowski and Bruknerl l2009h fsee 
Secti on lV.Aj) . In particular, it has been shown in ( Tonerl . 
l2009f ) that \(B A b) \ + \ (Bac)\ < 4, which is tight if we 
allow any no-signaling correlation. 

The fact that QKD protocols based on nonlocality can 
be proven secure (see Section IIV.CP can also be under- 
stood as a consequence of the monogamy of quantum 
correlations among Alice, Bob and Eve, and was indeed 
one of the factors motivating its study. 

Underlying the monogamy of correlations in the quan- 
tum s etting is an inherent monogamy of entangle- 
ment (jTerhall 120041) . Understanding the exact relation 
between both forms of monogamy is an interesting open 
problem. 



D. Nonlocality of multipartite quantum states 

1. Multipartite nonlocality vs multipartite entanglement 

In this section we discuss the relation between quan- 
tum nonlocality and entanglement in the multipartite 
setting. Similarly to the bipartite case, the two concepts 
are intimately related, although understanding precisely 
the link is a challenging problem. 

First, n ote that all pure entan g led n- partite states are 
nonlocal ( Popescu and Rohrlichl . Il992l ). That is, their 
measurement statistics cannot be decomposed in the 
form (|67|) . This follows from the fact that it is always 
possible for n — 2 parties to project (via a local projec- 
tion) the remaining two parties in a pure entangled state. 
Since the latter is nonlocal, the result follows. It should 
be stressed that this result does not rely any form of 
post-selection. 

In entanglement theory, a concept of particular im- 
portance is that of genuine multipartite entanglement. 
A quantum state features genuine multipartite entangle- 
ment when it cannot be decomposed as a convex combi- 
nation of biscparable states (states which are separable 
on at least one bipartition of the parties). Indeed, this no- 
tion is somehow analogous to that of genuine multipartite 
nonlocality, and it is not surprising that both are related. 
In particular, genuine multipartite quantum nonlocality 
can be obtained only if measurements on a genuine mul- 
tipartite entangled state are made. Thus, the presence 
of genuine multipartite nonlocality witnesses the pres- 
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ence of genuine multipartite entanglement. Importantly 
this is achieved in a device-independent way, that is, gen- 
uine multipartite entanglement is here certified without 
placing any assumptions about the devices used in the 
experiment, contrary to usual methods such as entan- 
glement witnesses and quantum tomography. Note that 
it is possible to design even better device-independent 
techniques for witnessing genuine multipartite, the vio- 
lation of which does not imply the pr e sence o f genuine 
multipartite n onloca lity dBancal et all l2011bf ) (see also 
(iNaeata et all . 120021: lUffinkL 12002^ . 

It is however not known whether all pure genuine mul- 
tipartite entangled state are genuine multiparti te nonlo- 
cal. It has been shown ( Almeida et al I l2010bh that all 
connected graph states are fully genuine nonlocal, in the 
no-signaling approach discussed in Scction rVI.A.31 More- 
over, it was also shown that the tangle, a specific measure 
of multipartite entanglement, is closely related to the vio- 
lation of Svetlichny 's inequality ( Aiov and Rungtall2010t 
lGh ose et al. I. I2009I) . In particular, from this connection it 
can be shown that there exist pure entangled states in the 
GHZ class which do not violate Svetlichny's inequality. 

Finally, it is worth noting that the connection between 
genuine multipartite entanglement and nonlocality may 
depend on which definition of genuine multipartite non- 
locality is used. Using the definition based on time- 
ordering (see (|7Tj) ). numerical evidence suggests that all 
pure genuine tri partite entangled sta tes are genuine tri- 
partite nonlocal ([Barrett et all l2~01ll ). 



2. Greenberger-Horne-Zeilinger states 

Greenbcrgcr-Hornc-Zcilingcr (GHZ) states are to- 
day arguably the most studied — and possibly the best 
understood — multipartite quantum states from the point 
of view of entanglement and nonlocality. GHZ states dis- 
play one of the most striking forms of nonlocality in the 
context of the Mermin-GHZ paradox (see Section flLDp . 
By performing local measurements on a tripartite GHZ 
state 



|GHZ) = -±=(|000) + |111)) 



(78) 



one obtains correlations which are maximally nonlocal, 
since the predictions of quantum mechanics are here in 
full contradiction with those of local models. Interest- 
ingly, it turns out however that these particular GHZ 
correl ations do not feature genuine multip artite nonlo- 
cality (|Cerecedal . 120021: iMitchell et al. \. \2Q04 . as they can 
be reproduced by a bi-separable model of the form (jS5|). 

It is nevertheless possible to generate genuine multi- 
partite nonlocal correla tions from local m easurements on 
a tripartite GHZ state ( Svetlichnvl . Il987f ). In particular, 
one can get violation of Svetlichny's inequality ((74"]) of 
53 = 4-\/2 > 4, which turns out to be the largest possibl e 
violation in quantum mechanics ( Mitchell et all l2004h . 
This violation can be intuitively understood by consider- 
ing again the form (|75[) of Svetlichny's inequality. Since 



it is Charlie's measurement setting that dictates which 
version of the CHSH game Alice and Bob are playing, 
the best strategy for C consists in remotely preparing 
(by performing a measurement on her qubit) a state for 
AB that is op timal for the violatio n of the corresponding 
CHSH game (|Bancal et aLl . l2011ah . 

The nonlocal correlations of generalized GHZ states, 
of the form 



GHZ 



N 



1 

72 



d-l 

£i;: 

3=0 



(79) 



featuring N parties and systems of local dimension 
d, have also been investigated. Fir st, analogues of 



the Mermin-GHZ paradox were fo und (|Cerf et al. I. l2002t 
IZukowski and KaszlikowskH 1 1999T) . Note however that 
examples of paradoxes are not known to exist for any 
combination of N and d; beyond the original example of 
GHZ (N = 3 and d = 2), the simpl e st kn own example 
features N = 5 and d = 4 (|Cerf et aUl2002f ). A Mermin- 
GHZ type paradox was als o presented for the case of 
continuous variable systems (Ivan Loock and Braunsteinl . 
120011 : iMassar and Pirorliol l200ll ). 

The genuine multipartite nonlocality of generalized 
GHZ states has also been investigated. First it was 
shown that all qubit GHZ states (i.e. |GHZ^)) violate 
the generalization (|76p of Svetlichny's inequality for an 
arbitrary number of parties, and hence display genuine 
multipartite nonlocal co r relati ons ( Collins et aLl . T2002bt 
ISeevinck and Svetlichnvl . |2002|) . Recently, it was shown 
that the correlations of any state of the form (|T9"|) are fully 
genuinely multiparti te nonlocal, as wel l as monogamous 
and locally random ( Aolita et al\ . l2012f ) . The robustness 
of GHZ nonlocality against local noise was investig ated 
in Refs. ( Chaves et all . 120121 : lLaskowski et al ]. l2010h . 



3. Graph states 



Graph states (|Hein et all l2004h form an important 
family of multipartite quantum states (including GHZ 
and cluster states) useful for applications in quantum in- 
formation science. In particular, all codeword states used 
in the standard quantum error correcting codes corre- 
spond to graph states, and one-way quantum computa- 
tion uses graph states as a resource. Here we will discuss 
the nonlocality of graph states (for GHZ states see Sec- 
tion EEZE2). 

Graph states are defined as follows. Let G be a graph 
featuring n vertices and a certain number of edges con- 
necting them. For each vertex i, we define neigh(i) as 
the neighborhood of i, which represents the set of ver- 
tices which arc connected to i by an edge. Next one 
associates to each vertex i a stabilizing operator 



9i = Xi 



Z 4 



(80) 



j'encigh(i) 



where Xi, Yi, and Zi represent the usual Pauli matrices 
a Xl <7 y , and a z applied to qubit i. The graph state \G) 
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associated to graph G is then the unique common eigen- 
vector to all stabilizing operators <?i, i.e. gi\G) = \G) 
for all i 6 {1, ...,n}. From a physical point of view, the 
graph G describes all the perfect correlations of the state, 
since (G\gi \G) = 1 for all i E {1, ...,n}. By considering 
the set of operators that can be obtained from products 
of stabilizer operators (|80[) . one obtains a commutative 
group featuring 2™ elements. This is the stabilizer group, 
defined as 



The state | CI4) is defined by the stabilizer relations 



S(G) = { Sj } j=1 



where Sj = 



(81) 



where Ij (G) denotes any of the 2™ subsets of the vertices 
of the graph G. 

Interestingly, this fundamental structure of graph 

states underpins a strong for m of nonlocality 

(|Guhne et all 120051; IScarani et al\ , 120051 ). It turns 



out that all graph states feature nonlocal correlations 
( Guhne et Ml, l2005|> . In order to prove this, the main 



idea consists to construct Bell inequalities by adding all 
elements of the stabilizer group S(G). Thus we consider 
the operator 



(82) 



i=i j=i 



where operators O* € {1, Xj, Yj, Zj} are from the Pauli 
basis. 

It is then possible to define a Bell inequality based on 
the above Bell operator, and to compute its local bound 



L(G) = max I (6) I 

LHV 



(83) 



While the graph state \G) reaches the value of 2 n for such 
Bell inequality (indeed s t \G) = \G) for all i e {1, 2"}), 
it turns out that L{G) < 2 n for any graph G. Thus, 
for all graph states it is possible to construct a Bell in- 
equality, which the state then violates maximally. Indeed 
this demonstrates that nonlocality is a generic feature of 
all graph states. Moreover, for certain families of graph 
states, basically states based on tree graphs (featuring no 
closed loops), the violation of the Bell ine quality grows 
expo nentially with the number of vertices (|Guhne et all 
120051) . 

While the generality of the above approach is re- 
markable, it is possible for certain important classes 
of graph states, in particular for cluster states, to de - 
rive stronger proofs of nonlocality ( Scarani et all [20051) . 
Cluster states form a subclass of graph states based on 
square lattice graphs. For simplicity and clarity we will 
discuss here the case of a four qubit cluster state on a 
one-dimensional lattice 15 , which is locally equivalent to 

|C1 4 ) = -(|0000) + 10011) + |1100) - |1111)). (84) 



X z 1 1 

Z X z 1 

1 Z X z 

1 1 Z X 



1 (£71) 

1 (£72) 

1 (£73) 

1 (£74) 



(85) 



By multiplying certain of these four relations, we get 



(£71) x (£73) 

(£72) x (£73) 

(£71) x (£73) x (£74) 

(£72) x (£73) x (£74) 



X 1 X z 

Z Y Y Z 

X 1 Y Y 

Z Y X Y 



-- 1 
= 1 
= 1 
-1 



(86) 



Note that here we have used the Pauli algebra, which ex- 
plains the emergence of a minus sign in the last relation 
above. It can be readily checked, that for any deter- 
ministic local model, i.e. attributing ±1 values to each 
measurement (X, Y, Z), it is impossible to satisfy simul- 
taneously all four relations above; at least one of them 
will not hold. Check for instance that by simply multi- 
plying (using standard multiplication) the first three rela- 
tions in (|86p . one obtains the fourth relation. Therefore, 
we obtain a perfect contradiction betwee n quantum and 
classical predictions ( Scarani et aUl2~005l ). in the spirit of 
the GHZ paradox. 

Similarly to the Mcrmin-GHZ case (see Section Hl.DI) . 
this logical contradiction can be rephrased as a Bell in- 
equality. By considering the four relations the 
structure of this Bell inequality was simply deduced to 
be: 

\aia' 3 a4 + aia^a^ + 0^020304 — a'^a^a^l < 2 (87) 

Notice that by grouping the first two parties one obtains 
the Mermin inequality (|38j). Performing measurements 
on the state ICI4), the algebraic maximum of 4 can be 
obtained for the left hand side of (|87|) . Finally note that 
an interesting feature of the Bell inequality (|87|l is that 
it cannot be violated by the four-qubit GHZ state. Thus 
the inequality is a strong entanglement witness 16 for the 
cluster I CI4) . The above construction can be generalized 
to cluster states of an arbitrary number of qubits and of 
arbitrary loc al dimension, as well as to certain classes of 
graph states ( Scarani et all 120051 ) . 

An interesting issue is to understand whether there 
exists a link between the nonlocality of cluster states and 
the computational power they offer. Although such a 
connection has not been clearly esta blished yet, progress 
has been made ( Hoban et all I20T1I) . Another important 



class of graph states is codeword states, the nonlocalit; 



of wh ich has been discussed in (IDiVincenzo and Pere; 
119971) . 



15 Note that for two and three qubits, the ID cluster state is equiv- 
alent to a Bell state and to a GHZ state, respectively. 



Notice however that, as written, the inequality l|87| l can also 
be maximally violated by a three-partite entangled state, since 
party 2 has only one setting. This deficiency can be overcome 
by symmetrizing the inequality over the parties. 
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Finally, it is important to note that all nonlocality 
proofs discussed in this subsection concern Bell nonlo- 
cality. Unfortunately much less is known concerning the 
genuine multipartite nonlocality of graph states. It is 
however known that all states based on connected graphs 
(graphs in which any two vertices are connected, al- 
though not necessarily in a direc t manner) display full y 
genuine multipartite nonlocality ([Almeida et all 12010 a) . 



4. Nonlocality of other multipartite quantum states 

In the multipartite case, entanglement displays a rich 
structure. There exist many inequivalent classes of mul- 
tipartite entanglement. Although we know that all mul- 
tipartite entangled pure states are nonlocal, very few is 
known beyond the case of graph states. 

An important class of multipartite entangled states are 
Dicke states, that is, states with a fixed number of exci- 
tations and symmetric under permutation of the parties, 
which are ce ntral in the c ontext of the interaction of light 
and matter ( Picket fl954l) . The symmetric state of n par- 
ticles with a single excitation, known as the W state, 
reads 



|W„> = -^(|0...01) 



|10...0» 



Such states are relevant to the description of various 
physical systems, such as quantum memories. One pos- 
sibility for detecting the nonlocality of W states consists 
in having n — 2 parties performing a measurement in the 
logical basis {|0),|1)}. When all project onto the |0) 
eigenstate, which happens with a fairly large probability 
(increasing with n), they prepare for the remaining two 
parties a (two qubit) Bell state, on whic h the CHSH in- 
equa lity can then be tested and violated ( Sen(De) et all 
120031 ). Another manifestation of the nonlocality of the 
Dicke states is based on their robustness to losses. In- 
deed when k <C n particles are lost, the state remains 
basically unchanged. For instance, for W states one has 
that tr fc (|W„> (W„|) « \W n - k ) {W n - k \, where tr* de- 
notes the partial trace on the k particles which have been 
lost. Th e W state thus has a high " persistency" of non- 
locality ( Brunner and Vertesil . I2012T) . in the sense that a 
large number of particles must be lost in order to destroy 
all nonlocal correlations. This appears to be a generic 
feature of Dicke states. 

Another relevant problem is whether one can distin- 
guish different classes of multipartite entangled states 
via their nonlocal correlations. This can be done us- 



ing judiciously designed Be ll inequalities (jBrunner et all 
120121 ; ISchmidt et al. I. I2008h . For instance, the resistance 
to losses of W states can be exploited to distinguish their 
nonlocal correlations from those of GHZ states. 

The nonlocal properties of more general classes 
of states have been discussed. First, the nonlo- 
cality of symmetric qubit states was investigated in 
( Wang and Markhaml . 120121 ) . Exploiting the Majorana 



representation, the authors could derive Hardy-type non- 
locality proofs (see Section Hl.Ep for arbitrary symmetric 
pure entangled states. Also, the resistance to noise (i.e. 
determining the maximal amount of white noise that can 
be added to a state such that it remains nonlocal) has 
been evaluated num erically for a l arge c lass of multipar- 
tite quantum states ( Gruca et all\20ldt) . 

Finally, the relation between entanglement distillabil- 
ity and nonlocality was also i nvestigated in the multi- 
parti t e case. It w as shown in ( Augusiak and Horodeckil . 
120061 ; iDiirl . 120011) that an multi-qubit bound entangled 
state can violate the Mermin inequalities. However, the 
states considered in these works become distillable when 
sever al parties can group. In fact, it was shown in ( Acini . 
I2001D that the violation of the Mermin inequalities im- 
plies that distillibality between group s of parties. More 
recently, ( Vertesi and Brunnerl l2012t) presented an ex- 



ample of a fully bound entangled state (for which no 
entanglement can be distilled even parties are allowed 
to group) which violates a Bell inequality. This shows 
that nonlocality does not imply the presence of distill- 
able entanglement, and refutes the Peres conjecture in 
the multipartite case (see Section lill. A. 5|) . 



VII. EXPERIMENTAL ASPECTS 

Violations of Bell inequalities have been observed ex- 
perimentally in a variety of physical systems, giving 
strong evidence that nature is non-local. Nevertheless, 
all experiments suffer from various loopholes, opened by 
technical imperfections, which make it in principle possi- 
ble for a local model to reproduce the experimental data, 
even if in a highly contrived way. In recent years, an 
intense research effort has been devoted to the design 
and realization of a loophole-free Bell experiment, which 
should be within experimental reach in the near future. 
Besides its fundamental interest, closing some of these 
loopholes (in particular the detection loophole) is impor- 
tant from the perspective of practical applications of non- 
locality such as device- independent quantum information 
processing. Indeed, while the idea that Nature is ex- 
ploiting such loopholes to fake non-local correlations may 
sound conspiratorial, the perspective is entirely different 
when we consider the possibility that they are exploited 
by an adversary to break a cryptography protocol. 

In this section we review the main achievements and 
challenges in this area. For a more exhaustive discussion 
on Bell experiments, wc refer the reader to recent reviews 



(jGenovesel 120051 ; iPan et aUl20"l2l ). 



A. Bell experiments 

1. Photons 

Tremendous experimental progress in quantum optics 
during the 1960s opened the door to possible tests of 
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quantum nonlocality in the laboratory. First, using 
atomic cascades, it became possible to create pairs of 
photons entangled in polarization. Second, the polariza- 
tion of single photons could be measured using polar- 
izers and photomul tipliers. Only three years after the 
proposal of CHSH (IClauser et all Il969l) .' Freedman and 
Clauser ( Freedman and Claused . 1 19721) reported the first 
conclusive test of quantum nonlocality, demonstrating a 
violation of the CHSH Bell inequality by 6 standard de- 
viations. 

During th e following years, other exper i ments 



([Aspect et all Il98ll Il982bl; iFrv and Thompson! . Il976l ) 
were performed, giving further confirmation of the pre- 
dictions of quantum mechanics. However, the main draw- 
back of all these experiments was that they were per- 
formed with static setups in which the polarization ana- 
lyzers were held fixed, so that all four correlations terms 
had to be estimated one after the other in four successive 
runs. In principle, the detector on one side was thus fully 
aware of the measurement setting chosen on the other 
side, and furthermore one could not reasonably consider 
that the measurement settings were independent of the 
particles behavior, thus leaving serious loopholes open 17 
(see subsection IVH.B~2j) . 

Crucial progress came in 19 8 2, whe n Aspect, Dal- 
ibard and Roger ( Aspect et all Il982al) performed the 
first Bell experiment with time-varying polarization an- 
alyzers. The settings where changed during the flight of 
the particle and the change of orientation on one side 
and the detection event on the other side were separated 
by a space-like interval, thus closing the locality loophole 
(see section lVII.B.2[) . It should be noted though that the 
choice of measurement settings was based on acousto- 
optical switches, and thus governed by a quasi-periodic 
process rather than a truly random one. Nevertheless the 
two switches on the two sides were driven by different 
generators at different frequencies and it is very natural 
to assume that their functioning was uncorrelated. The 
experimental data turned out to be in excellent agree- 
ment with quantum predictions and lead to a violation 
of the CHSH inequality by 5 standard deviations. 

The advent of quantum information in the 1990s trig- 
gered renewed interest in experimental tests of quan- 
tum nonlocality. In 1998, violation of Bell inequali- 
ties with photons separated by more than 10km was 
reported (|Tittel et all Il998l) . The same year, another 



experiment demonstrated violation of Bell inequalities 
with the locality loophole closed and using a quan- 
tum random num ber generator to g enerate the mea- 
surement settings ( Weihs et aZ.I . I1998I) . In turn, both of 



these experiments were adapted to implement quantum 



17 By performing Bell tests with all correlation terms measured suc- 
cessively with the settings held fixed, it is not unusual to observe 
experimentally, because of slow drifts in the set-up, apparent 
violations of Bell inequalities above Tsirel son's bound or even 
violation of the no-signaling conditions (0 (Afzclius, 2011). 



key distribution based on nonlocal quantu m correlations 
dJennewein et aZ l l2000tlTittel et aLl . [2000h . following Ek- 
ert's idea rtEkertl Il99ll) ■ 



Demonstrations of quantum nonlocality in photonic 
systems have been reported using various types of en- 
coding apart from polarization. First, Bell inequality vi- 
olations based on phase and momentum of photons have 
been achieved ( Rarity and Tapsterl . ll990T) . In 1989, Fran- 
son proposed a test of quantum nonl ocality based o n 
the energy-time uncertainty principle ( Fransoil Il989l) . 
This encoding, use d for instance in the experiment of 



dTittel et all 119981). lead t o the concept of time-bin en- 
coding (|Tittel et all [l999h which turned out to be par- 
ticularly well suited for the distribution of entanglement 
on long distances. Bell inequality violation have also 
been demonstrated using photons ent angled in orbital 
angular momentum ( Mair et al I 1200 ID . An important 
advantage of both time-bin and orbital angular momen- 
tum encodings, is that they allow for the realization of 
higher dimensional quantum systems, whereas polariza- 
tion is limited to qubits. Nonlocal co rrelations of qutrit s 
have been reported with time-bins ((Th ew et al. . 2004). 
while Bell violation with orbital angular momentum have 
recentl y been reported u sing systems of dimensions up 
to 11 ( Dada et all l201lh . Another possibility to create 
higher dimensional entanglement consists in generating 
pairs of photons entangled in several d egrees of free - 
dom, so called hyper-entangled photons ( Kwiatl Il997h . 
Bell experiments have been performed with s uch systems 
( Barreiro et al 1 120051: ICeccarelli et aZ.I . l2009h . combining 
polarization, spatial mode and energy-time degrees of 
freedom. Finally, the phenomenon of hidden nonlocal- 
ity (see section III.A .3) was also experimentally reported 
(|Kwiat et qZ.I . l200lh . 

Multipartite quantum nonlocality has also been 
demonstrated experimentally. Bell inequality viola- 
tions were achieved wi th three photons, generati ng both 
GHZ (|Pan et all l2000h and W (lEibl et all\2004 s tates 



and with f our-ph oton GHZ states (lEibl et all 12005 ; 
Zhao et al. I, l2003h and Cluster state (jWalther et al 



20051 ). Genuine multipartite nonloc ality of three-photo n 
GHZ states was also demonstrated ( Lavoie et all 120091) . 



Note also 
can even be 



that nowadays Bell experiments 
envisaged for pedagogical purposes 



ge d to r 

(|Dehlinger and Mitchell 120021) . In partic ular, rea dy- 
to-use setups are available commercially ( Qutoolsl Q), 



which ar e fully operational, even from the perspective of 
research ( Pomarico et all 1201 lal) . 

Finally, it is important to keep in mind that all the Bell 
experiments discussed above are plagued by the detec- 
tion loophole (see subsection IVII.B.1|) . This is because 
the photon detection efficiency in these experiments is 
low (typically 10 — 20%) which makes it possible, in prin- 
ciple, for a local model to reproduce the raw data. It 
is only under the assumption that the probability of de- 
tecting or non-detecting a photon is independent of the 
choice of measurement (the so-called 'fair-sampling' as- 
sumption, allowing one to discard inconclusive events) 
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that the experimental data leads to Bell inequality viola- 
tions. Recently though an experimental violation of Bell 
inequalitie s with the det e ction loophole closed was re- 
ported in ( Giustina et «m2012f) . This experiment, how- 
ever, is a table-top experiment using a static setup with 
measurement settings held fixed and thus did not close 
the locality loophole. 



2. Atoms 

Besides photons, Bell experiments have also been con- 
ducted with atomic systems. Such systems offer an im- 
portant advantage from the point of view of the detec- 
tion, with efficiencies typically close to unity. Therefore, 
atomic systems are well-adapted for performing Bell ex- 
periments free of the detec tion loophole. Suc h an exper- 
iment was first realized in ( Rowe et~al\ , l200ll) , using two 
Be+ ions in a magnetic trap. In this experiment, the two 
ions were placed in the same trap, separated only by 3 
/im. The locality loophole was thus left wide open, since 
each ion can feel the light field aimed at measuring the 
state of the other ion. 

More recently, quantum nonlocality was demonstrated 
between two Yb+ ions sitting in sep arated traps, one 



between two Y p+ ions sitting m sep arated traps, one 
meter apart ( Matsukevich et all , 120081 ) . This was further 
i mproved to a distanc e of 20m using Rubidium atoms 
( Hofmann et all l2012t ). Although this distance is still 
insufficient to close the locality loophole — a distance of 
300m is required using the fastest procedure to mea- 
sure the atomic state of the atoms — the crosstalk be- 
tween the two atoms is now completely suppressed. Here 
the entanglement between the distant atoms is achieve d 
using an 'event- ready' scheme ( Simon and Irvinel |2003|) . 
sketc hed in Fig. [71 wh i ch is b ased on entanglement swap- 
ping (|Zukowski7i all Il993l ). First each atom is trans- 
ferred to an excited state. The ion desexcites by emitting 
a photon. The structure of the atomic levels is chosen 



such that the polarization of the emitted photon is maxi- 
mally entangled with the state of the atom. The emitted 
photons are then collected in single mode optical fibers. 
Finally a partial Bell state measurement is performed on 
the two photons, using a simple beam-splitter followed by 
single photon detectors. A coincidence detection of two 
photons at the detectors indicates that the photon were 
in a given Bell state. In this case entanglement swap- 
ping is achieved, that is, the initial atom-photon entan- 
glement has been converted to atom-atom entanglement. 
Upon successful detection of the photons, local measure- 
ments are performed on the atoms. The procedure is 
repeated until enough data has been taken in order to 
obtain good statistics. Important advantages of such an 
'event ready' experiment is its robustne ss to photon losses 
and t o the coincidence-time loophole ( Larsson and Gill 
I2004J ) . Recently, such an experiment was used to conduct 
a proof-of-principle dem onstration of device-in dependent 
randomness expansion (jPironio et all l2010al ) (see Sec- 
tion 



3. Hybrid schemes and other systems 

Finally, let us mention that Bell inequality viola- 
tions have also been report e d usi ng atom-photon en- 
tanglement ( Moehring et all |2004[ ). and entanglement 
between a photon and a collective atomic excitation 
( Matsukevich et "oZI. l2005h . Nonlocality was also demon- 
strated in Josephson phase superconducting qubits. In 
par ticular, violation of the CHSH inequality was achieved 
in ( Ansmann et all l2009f). whereas the GHZ paradox 
was d emonstrated in ( DiCarlo et "oil. 120101 : iNeelev et al . 
l2010h . 



B. Loopholes 

1. Detection Loophole 



V 




FIG. 7 Bell test based on distant entangled atoms. Each atom 
is entangled with an emitted photon. Upon successful projec- 
tion of the two photons onto a Bell state, the two atoms be- 
come entangled. The scheme is therefore 'event-ready', which 
makes it robust to photon losses in the channel. Moreover, 
since atomic measurements have an efficiency close to one, 
this scheme is free of the detection loophole. This setup has 
been implemente d experimentally with a distance of 20m be- 
tween the atoms (|Hofmann et aZ.L I2012T) . and used for dev ice- 
independent randomness expansion (jPironio et all l2010al h 



In a large class of Bell experiments, in particular all 
those carried out with photons, measurements do not al- 
ways yield conclusive outcomes. This is due either to 
losses between the source of particles and the detectors 
or to the fact that the detectors themselves have non-unit 
efficiency. A measurement apparatus, used e.g. to test 
the CHSH inequality, has then three outcomes instead of 
two: it can as usual give the outcomes —1 or +1, or it 
can give a 'no-click' outcome, denoted U. The simplest 
way to deal with such 'anomalous' data is simply to dis- 
card them and evaluate the Bell expression on the subset 
of 'valid' ±1 m easu rement outcomes. As poin ted out by 
(|PearleL Il970( ) and (jClauser and Hornd . 1 19741 ) . this way 
of analyzing the results is only consistent under the as- 
sumption that the set of detected events is a fair sample, 
i.e., that the accepted data is representative of the data 
that would have been recorded if the detectors had unit 
efficiency. More generally, one can consider local models 
where this fair-sampling assumption fails and in which 
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the probability to obtain a 'no -click' outcome _L depends 
on the choice of measurement ( Clauser and Horne1 . ll974 



iPearleL 1 19701 : ISantosl Il992h . If the detection efficiency 
is too low (below a certain threshold), such local mod- 
els can completely reproduce the observed data, opening 
the so-called detection loophole. The threshold efficiency 
required to close this detection loophole is typically high 
for practical Bell tests. As a consequence, most optical 
realization of Bell tests performed so far are plagued by 
the detection loophole. 



a. Faking Bell inequality violations with post-selection. 
Throwing away 'no-click' outcomes and keeping only the 
valid outcomes ±1 is an example of post-selection. In 
general, allowing for post-selection in a given theory, al- 
lows one to achieve tasks which would be impossible with- 
out it. In particular, post-selection makes it possible to 
fake the violation of a Bell inequality, even in a purely 
local theory. 

To illustrate this idea, we will see how it is possi- 
ble for a local model to fake maximal violation of the 
CHSH inequality. In particular, we will show how to 
generate Popescu-Rohrlich correlations, a b = xy, 
where x, y,a,b € {0, 1} (see Section lTl.C.2p . starting from 
shared randomness, and allowing the detectors on Al- 
ice's side to produce a no-click outcome _L. The model is 
the following. The shared randomness correspond to two 
uniform random bits x gucss and a. Given measurement 
setting y, Bob's detector outputs b = a x gucas y. Alice's 
detectors outputs a whenever her measurement setting is 
x = .T gucss , and outputs _L when x ^ x gucss . Focusing on 
the valid outcomes, Alice and Bob have achieved maxi- 
mally nonlocal PR correlations, i.e. achieving a CHSH 
value of S = 4. The probability for Alice to obtain a 
conclusive outcome is 1/2, which is the probability that 



Xgucss J 



while Bob always obtains a conclusive out- 
come. With additional shared randomness, it is possi- 
ble to symmetrize the above model, such that Alice and 
Bob' s detection probability is 2/3 (jMassar and Pironiol 
120031) . Therefore, if the detection efficiency in a CHSH 
Bell experiment is below 2/3, no genuine Bell inequality 
violation can be obtained, since the above local strat- 
egy could have been used by the measurement appara- 
tuses. More generally, the minimum detection efficiency 
required for successfully violating a given Bell inequal- 
ity depends on the number of parties and measurements 
involved (see Section IVH.B. 1 .b[) . 

Interestingly, recent experiments demonstrated fake 
violations of Bell in equalities using classical optics 
( Gerhardt et all 1201 ll ). positiv e Wigner function states 



and quadrature measurements ([Tasca et al 
a classical amplification scheme (jPomarico et all 
These experiments are performed under the same con- 
ditions as standard Bell experiments, but exploit side- 
channels. This nicely illustrates the importance of clos- 
ing the detection loophole in Bell tests, in particular for 
the perspective of implementing device-independent pro- 



tocols. 



b. Taking into account no-click events. The previous dis- 
cussion shows that in order to close the detection loop- 
hole no-click outcomes cannot be discarded without mak- 
ing further assumptions. The most general way to take 
no-clicks events into account is simply to treat them as 
an additional outcome and instead of a d-outcome Bell 
inequality (if the number of 'valid' outcomes is d) use a 
rf+l-outcomc Bell inequality. A possible way to obtain an 
effective d + 1-outcome Bell inequality from a d-outcomc 
one is simply to merge the no-click outcome with one of 
the valid outcomes 18 , i.e., systematically assign one of 
the valid outcomes to the no- click events. In particular , 
the Clauser- Home inequality ( Clauser and HorneIll974l) . 
which is often used in Bell tests with inefficient detectors, 
is nothing but the CHSH inequality where the —1 out- 
come and the no-click outcome _L have been merged into 
one effective '— 1' outcome. 

Assigning one of the valid outcomes to the no-click 
outcome _L is often the optimal way to treat no-click 
events, although there is no general proof of this and 
a counter-example exists for no-signaling correlations 
(IWilms et all I2008D . [n the case where d detectors are 



used to register the d outcomes of a measurement, as- 
signing one of the valid outcomes to the no-click events 
has also the technical advantage that the detector associ- 
ated to that particular outcome is no longer needed, i.e. 
only d — 1 detectors arc sufficient since no distinction is 
being made between obtaining the d-th outcome and not 
detecting anything. 



c. Threshold efficiencies. When treating the no-click out- 
come as described above, one will generally find that a 
Bell violation is obtained only if the detector efficiencies 
are above a certain threshold. The minimal threshold 
efficiency -q* required to close the detection loophole, de- 
pends generally on the number of parties, measurements 
and outcomes involved in the Bell test. Moreover, rf may 
also vary depending on the exact set of correlations that 
is considered. Thus, in quantum Bell tests, rf may also 
depend on which entangled state and which measurement 
settings are considered. Below we review the efficiency 
thresholds for the most important Bell inequalities and 
for the most common quantum entangled states. 

We start by deriving rj* for the CHSH Bell inequality 
using a two-qubit maximally entangled state. Performing 
judicious local measurement on this state, one obtains a 



18 Inequalities obtained in this way arc liftings of the original 
inequality (Pironio, 2005). If the original inequality is facet- 
defining for the d-outcome Bell polytope, then the lifted inequal- 
ity is facet-defining for the d + 1-outcomc polytope. However, 
the d + 1-outcome Bell polytope has also in general facets that 
cannot be viewed as liftings of d-outcome inequalities. 
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CHSH value of S — 2\[2 (the maximum possible value in 
quantum mechanics). Now, let us assume that Alice and 
Bob have imperfect detectors with efficiency r\ and that 
when a no-click results _L is obtained, they assign to it the 
+1 outcome. When both detectors click, which happens 
with probability rj 2 , Alice and Bob achieve S = 2\[2. 
When only one detector clicks, the outcomes are com- 
pletely uncorrelated leading to S — 0. Finally, when no 
detectors click, which happens with probability (f — rj) 
Alice and Bob both always output +1, thus achieving 
the local bound S = 2. In order to close the detection 
loophole, we must ensure that the entire data of the ex- 
periment violates the CHSH inequality, i.e. that 



r) 2 2V2 + (I -r,) 2 2 > 2 



This leads to the condition that 

2 



rj > rj 



1 + V2 



82. 



(89) 



(90) 



where r/* denotes the threshold efficiency Therefore, in 
order to get a detection loophole free CHSH violation 
with a two-qubit maximally entangled state, it is suffi- 
cient to have a detection efficiency larger than ~ 82.8%. 
This efficiency is also necessary: an explicit local model 
can be built which reproduces the experimental data 
when -q < 2/(1 + s/2). 

Remarkably, it turns out that this threshold efficiency 
can be lowered by considering partially entangled states 
(of the form = cos 9 |00) + sin 9 | 11)) as discovered by 
( Eberhardl . ll993l ). In particular, in the limit of a product 
state (i.e. 9 — > 0) one obtains that r/* — > 2/3. This as- 
tonishing result was the first demonstration that some- 
times less entanglement leads to more nonlocality (see 
Section MAJ}- 

Beyond the C HSH case, discussed in detail in 
dBranciardl l201ll) . it is known that lower threshold ef- 
ficiencies can be tolerated for Bell inequalities featur- 
ing more measurement settings. A lower bound for the 
threshold efficiency is given by 



1* > 



m,A + ra B - 2 

TflATtl B — 1 



where wia (tub) denotes the num ber of settings of Alice 
(Bob) ([Massar and Pironiol , l2003t ). While it is not known 
whether this bound can be obtained in general with quan- 
tum correlations, improvements over the threshold effi- 
ciencies of the CHSH inequalities have been obtained by 
considering Bell scenarios with more measurement set- 
tings. For qubit states on l y min o r improvemen ts were 
found dBrunner and Gisirl 120081: iMassar et al\ , l2002t 
iPal and Vertesil . l2009h . However, Massar showed that, 
when considering systems of higher Hilbert space dimen- 
sion d, the threshold efficiency can become arbitrarily 
close to zero. Unfortunately, this result is of limited 
practical interest since improvements over the CHSH case 
are only obtained for systems with d > 1600. More re- 
cently a Bell test involving entangled quqats (d = 4) and 



four (binary) measurement settings was sh own to tolerate 
detec tion efficiencies as low as ~ 61.8% ( Vertesi et all 
2010). Such a scheme could be implemented optically 
using hyper-entanglement. 

Threshold efficiencies have also been derived for certain 
multipartite Bell tests (with n parties) , using qubit GHZ 
state s. Based on a combinatorial study, (jBuhrman et all 
l2003h showed that an arbitrarily small efficiency can be 
tolerated as n becomes large. Threshold efficiencies ap- 
proaching 50% in the limit of a la rge n where demon - 
strated for the Mermin inequalities ( Cabello et all 120081 ) 
and for a multipartite generaliza tion of the CH inequality 
( Larsson and Semitecolosl l200lh . More recently, multi- 
partite Bell tests tolera ting efficiencies s ignificantly below 
50% were reported in (Pal et aUl2012t) . 

Finally, detection efficiencies have also been consid- 
ered in asymmetric Bell experiments. Consider first the 
case in which Alice and Bob feature different detection 
efficiencies (tja and rjs respectively). In particular re- 
sults have been obtained for the case where t\a < 1 
and r/B = 1, which is relevant for Bell tests based on 
atom-photon entanglement (Alice holds the photon and 
Bob the atom). It has been shown that for the CHSH 
inequality th e efficiency of Alice's detector can be low- 
ered to 50% dBrunner et all l2007t ICabello and Larssonl 
120071 ). Moreover, this efficiency can be further lowered 
to ~ 43% by cons i derin g a three-setting Bell inequal- 
ity ( Brunner et all 120071) . Considering Bell tests with 
d measurement settings and d-dimcnsional systems, an 
effici ency as low a s 1/d can be tolerated, which is opti- 
mal (Vertesi et aZ.I , l2010l ). Another asymmetric scenario 
consists on the case where the local measurements have 
different efficiencies. Let us call r/A and r/Ai the effi- 
ciency of Alice's observables and r/B and r/B 1 the effi- 
ciency of Bob's. It has been shown that, in the CHSH 
scenario, if one of the measurements of each party has 
unit efficiency (say r)A = T]b = 1) then an arbitrarily 
low efficiency can b e tolerated for the other measurement, 
i.e.rjAi = "f]Bi 1 Garbarinol l2010h . 



(91) 2. Locality loophole 



The locality condition (3]) is motivated by the absence 
of communication between the two measurement sites of 
a Bell experiment. This seems well justified if the two 
sites are sufficiently separated so that the measurement 
duration is shorter than what it would take for any light- 
speed signal to travel from one site to the other. If this 
condition is not satisfied, one could in principle conceive 
a purely "local" mechanism - i.e., involving slower-than- 
light speed signals - underl ying the observed correlations 
(lAspectl . 119761 : [Bell . Il977ah . 

In addition to the requirement that the two measure- 
ment sites be space-like separated, it must also be the 
case that the measurement setting on one side is not de- 
termined by an earlier event that could be correlated with 
the measurement setting on the other side; in particular 
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it should not be correlated with the hidden variables A 
characterizing the source of particles. That is, the mea- 
surement settings must correspond to "random" or "free" 
choices, which are independent from the ot her s i de and 
from the hidden stat e of the particle pairs (iBeli Il977ri 
IShimonv et ailll97fift . 

Mathematically, the above requirements correspond to 
the following conditions 

P(a\x, y, b, A) = p(a\x, A) , p(b\y, x, a, A) = p(b\y, A) , 

(92) 

and 



q(X\xy) = q(X) 



(93) 



from which the locality decomposition © follows. Fail- 
ure to satisfy them is known as the locality loophole. 
The failure to address specifically the independence con- 
dition (|93[) between measurement choices and hidden 
variables is so metim es called the "freedom-of-choice" 
2010ft or "measurement-indepe ndence" 



(Scheidl et all 



loophole ([Branciard and Gisinl. 1 2011 
The experiment of ( Aspect et all 



Hal 12010ft. 



1982a) using entan- 



gled photons was the first to address convincingly the 
locality loophole. It involved on each side a switching de- 
vice for the incoming photons followed by two polarizers 
aligned along different orientations. A change of direc- 
tion occurred approximately every 10 ns. The distance 
L = 13 m between the two switches was large enough 
that the time of travel of a signal between the switches 
at light speed L/c = 43 ns was larger than the delay 
of 10 ns between two switchings. The switching of the 
polarizers was done through a home built device, based 
on the acousto-optical interaction of the incoming light 
with an ultrasonic standing wave in water. Using this 
mechanism it should be noted that not all photons were 
submitted to forced switching. In addition, the switches 
were not truly random, since the acousto-optical were 
driven by periodic generators. The two generators on 
the two sides, however, were functioning in a completely 
uncorrelated way, since they were operated by different 
RF generators at different frequencies with uncorrelated 
frequency drifts. 

The experiment of ( Aspect et all Il982al) was the only 
one involving fas t changes of t he m easurement settings 
until the one of (jWeihs et all Il998ft . which used high- 
speed electro-optic modulators to switch between two po- 
larization measurement settings on each side. The two 
modulators where controlled on a nanosecond timescale 
by two independent quantum random number genera- 
tors, excluding any light-speed influence between the two 
sides, which were separated by a distance of a few hun- 
dred meters. Leaving aside the possibility that the out- 
puts of the quantum random number generators are pre- 
determined at some hidden level, this setup is often re- 
garded a s having conc l usivel y closed the locality loop- 
hole. In (Sc heidl et al 1 12010ft . space-like separation was 
not only enforced between the outputs of the two ran- 
dom generators, but also between them and the emission 



of photons from the laser source generating the entan- 
gled particles, implying that these three processes are 
independent from each other, provided that they are not 
themselves determined by some earlier events. 

At this point, it should be stressed that, contrarily to 
the detection loophole, the locality loophole can never be 
"completely" closed. Strictly speaking, it requires space- 
like separation between the event determining the choice 
of measurement setting on one side and the event corre- 
sponding to the output of the measurement device on the 
other side. The problem is that this requires to know pre- 
cisely the time at which these two events happen. But if 
we use some random process that outputs a random value 
at time, say t = 0, how do we know that this value, even 
if it is independent from the the hidden variable and from 
the mechanism determining the measurement setting on 
the other side, was precisely generated by the random 
process at this time and not at some earlier time t = —51 
Similarly, how do we know precisely when a measure- 
ment is completed without maki ng some assu mptions on 
the collapse of the wav efunction (|Kentl. 2005ft ? This last 
issue was addressed in ( Salart et all 20081 ) where the vi- 
olation of Bell inequalities for events that are space-like 
separated according to a simple model of gravitational 
collapse has been reported. 

A second problem is that we may never be sure that the 
choices of measurements are really "random" or "free" . 
For instance , in t he experiments ( Scheidl et all l2010l 
IWeihs et al. I. il998ft the measurement choices are decided 
by processes that are genuinely random according to 
standard quantum theory But this need not be the case 
according to some deeper theory. Some people have ar- 
gued that a better experiment for closing the locality 
loophole would be to arrange the choice of measurement 
setting to be determined directly by humans or by pho- 
tons arriving from distant galaxies from opposite direc- 
tions, in which case any local explanat ion would involv e 
a conspiracy on the intergalactic scale ( Vaidmar] . l200ll ) . 



The point of this discussion is that an experiment 
"closing" the locality loophole should be designed in such 
a way that any theory salvaging locality by exploiting 
weaknesses of the above type should be sufficiently con- 
spiratorial and contrived that it reasonably not worth 
considering it. 

Finally, it is worth stressing that in device- independent 
applications of quantum nonlocality, the experimental re- 
quirements for satisfying conditions (|§2"T) and (|M|) are sen- 
sibly different than in fundamental tests of nonlocality, 
since one usually assumes the validity of quantum theory, 
that the quantum systems are confined in well defined 
measurement devices that can be shielded from the out- 
side world, that the inputs are under the control of the 
users, and so on. Th is stance was used for instance in 
( Pir onio et al 1 l2010aft . where the atoms were separated 
from each other, though by no means fulfilling any of the 
space-like separation prescriptions required for a funda- 
mental, locality loophole- free Bell test. 
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3. Finite statistics 

Since it is expressed in term of the probabilities for the 
possible measurement outcomes in an experiment, a Bell 
inequality is formally a constraint on the expected or av- 
erage behavior of a local model. In an actual experimen- 
tal test, however, the Bell expression is only estimated 
from a finite set of data and one must take into account 
the possibility of statistical deviations from the average 
behavior. The conclusion that Bell locality is violated 
is thus necessarily a statistical one. In most experimen- 
tal papers reporting Bell violations, the statistical rele- 
vance of the observed violation is expressed in term of the 
number of standard deviations separating the estimated 
violation from its local bound. Their are several prob- 
lems with this analysis, however. First it lacks a clear 
operational significance. Second, it implicitly assumes 
some underlying Gaussian distribution for the measured 
systems, which is only justified if the number of trials 
approaches infinity. It also relies on the assumption that 
the random process associated to the k-th trial is inde- 
pendent from the chosen settings and observed outcomes 
of the previous k — 1 trials. In other words, the devices 
are assumed to have no memory, which is a questionable 
assumption ( Accardi and Regolil . l2000h . 

A better measure of the strength of the evidence 
against local models is given by the probability with 
which the observed data could have been reproduced by 
a local model. For instance, consider the CHSH test and 
let (a x b y ) obs be the mean of the observed products of a 
and 6 when measurement x and y are c hosen computed 
over N trials. It is shown in (Gill, 120121 ). that the prob- 
ability that two devices behaving according to a local 
model give rise to a value S ohs = (an&o)obs + ( a <A)obs + 
(aibo)obs - (ai&i)obs > 2 + e is 



Pr(S obs >2 + e) <8e~ 4N ^ 2 



(94) 



This statement assumes that the behavior of the devices 
at the fc-th trial does not depend on the inputs and out- 
puts in previous runs. But this memoryless assumption 
can be avoided and similar statements taking int o ac- 
count arbitrary memory effects can be obtained ( Gilll . 
120031 ) . As discussed in section III.Gl it is easy to con- 
vince oneself that, in the limit of infinitely many runs, 
devices wit h memory cannot fake th e viola tion of a Bell 
inequality (|Barrett et all 120021: [Gill . 120031) . Indeed, for 
any given local variable strategy, there is always at least 
one set of settings for which that strategy fails in the 
corresponding Bell game. But in any run, the settings 
are chosen at random, independently of the source and 
of the past: therefore, even taking the past into account, 
the local variables cannot avoid the possibility that the 
wrong settings are chosen. This reasoning can be extend 
to the finite statistics case through the use of martingales 
(lGiill2003h . A better general method to deal with mem- 
ory effects and finite statistics which is asymptotically 
optimal in the limit of large trials has been proposed in 
(|Zhang et aZ.I . l201lh . 



C. Towards a loophole-free Bell test 

1. Photons 

The main drawback of photonic experiments for per- 
forming a loophole-free Bell test is the limited detec- 
tion of single photon detectors. Nevertheless, consid- 
erable progress has been achieved in the last years, in 
particular with the development of detectors based on 
superconducting materials, which can achieve detection 
efficiencies above 95%. Such detectors were recently 
used in Bell- type experiment. First, in ( Smith et all 
l2012t ) a detection loophole-free demonstration of quan- 
tum steering (see Section IVIII.Cp was reported. More re- 
cently, an experiment demonstrated violation of Clauser- 
Horne Bell inequality with the detection loophole closed 
( Giustina et all l2012h . Here total efficiencies of 75% 



were achieved for each party. Note however, that these 
experiments are table-top and using a static setup with 
measurement settings held fixed and thus did not close 
the locality loophole. Since the detection process in su- 
perconducting detectors is typically slow (of order of fis), 
achieving a loophole- free Bell violation will require a sep- 
aration of the order of 300m. 

Another possibility was recently suggested in 
(|Cabello and Sciarrinol |2012| ). which consists in pre- 
ccrtifying the presence of a photon before the choice of 
local measurement is performed. This proposal appears 
however challenging from an experimental point of view. 



2. Continuous variable systems. 

An interesting alternative for achieving high detection 
efficiencies using photon consists in using homodyne mea- 
surements, which measure a continuous degree of free- 
dom (often called quadrature) of the optical mode. Such 
measurements are realized by mixing the optical mode 
with an intense reference oscillator on a beam splitter, 
and can reach efficiencies close to unity nowadays. First 
proposal for using homodyne measurements in Bell tests 
were pres ented in (iGrangier et all Il988t iTan et all . Il99ll ) 
(see also ( Gilchrist et al. , 19981 )). Since then, many alter- 
native proposals were discussed. However, their experi- 
mental realization has remained elusive so far. 

First of all, it is important to mention that a homo- 
dyne measurement has a continuous number of possible 
outcomes. Since Bell inequalities have typically a discrete 
number of outcomes (for instance, binary outcomes in the 
case of CHSH), one has to dichotomize the outcome of 
the homodyne measurement, a procedure referred to as 
binning. For instance, a natural dichotomization strat- 
egy is given by the sign binning, where one assigns the 
values +1 if the measurement returns a positive outcome, 
and —1 otherwise. 

Homodyne measurements were shown to be able 
to dete ct the nonlocality of some quantum states 
( Munrol [l999). However, all tests proposed so 
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far present major difficulties for experimental real- 
izations. First, several schemes consider quantum 
state s which cannot be produced using current technol- 
ogy (lAcm et all l2009t ICavalcanti et all 120071: iMunrol . 
19991: IWenger et all \20oK ~ Second, the propo s als o f 
( Garcfa-Patron et al . 2004: Nh a and CarmichaelL \2004 



(see also ( Garcia-Patron et all l2005h ) use states which 
could be realized experimentally but lead only to very 
small Bell inequality violations, hence requiring an ex- 
tremely low level of noise, currently out of reach from 
an experimental point of view. Note that an experiment 
using homodyne measur ements demonstrated a violation 
of the CHSH inequality ( Babichev et all [20051 ). but post- 
selection was involved which thus opened the detection 
loophole. 

An interesting alternative consists in devising hy- 
brid schemes, which make use of ho modyne measure- 
ments as well as photo-detection. In ( Cavalcanti et all 
l2011bh it was shown that relatively high CHSH vi- 
olations can be achieved using a state that can 
be experimentally produced 19 . Promising de- 
velopments of hybrid s c hemes were recent ly dis- 
cussed in (lArauio et al . 2012t Bra.sk et al I l2012l 

iBrask and Chavesl 120121 : iQuintino et aZ.I . |2012[ k 

Finally, several works also considered more complex 
measureme nts, such as parity measur e ments . In par- 
ticular, in dBanaszek and Wodkiewicd . I1998T ) (see also 



( Banaszek and Wodkiewicz . 19991 )) it is demonstrated 



that such measurements can revea l the nonlocality of th e 
famous EPR state, discussed in (jEinstein et Ml, I1935D . 
The use of measureme nts based on nonlinear local oper- 
ations was proposed in ( Stobihska et aZ.l . l2007t ). However, 
realization of such measurements is still out of reach from 
current technologies. 



3. Atom-atom and atom-photon entanglement 

A promising avenue towards a loophole-free Bell 
test is based on the the possibility to generate long- 
distance entanglement between two trapped atoms 
( Simon and Irvinel |2003| ) . The procedure for entangling 
the two remote atoms consists in the joint detection of 
two photons, each coming from one of the ions, in an 
entangled basis. In this way, the initial atom-photon en- 
tanglement is transformed into atom-atom entanglement 
via entanglement swapping. 

This scheme was demonstrated experimentally 
using two trap pe d io ns separated by one meter 



dHofmann et al 



iPironio et al 



20121 : 
2010al) . 



iMatsukevich et all 120081: 
The detection loophole was 



closed in these experiments, thanks to the near unit 



19 Note that the idea of conside ring hybrid measurements was first 
discussed in l|ji et all | 2010l). although the proposed scheme is 
not a proper Bell test 1 Cavalcanti and Scaranj , l201ll ). 



efficiency of atomic measurements. In order to close the 
locality loophole, a distance of the order of 300 meters 
would be required using th e fastest atomic measur ement 
techniques available today ( Rosenfeld et all 120091 ). This 
is still currently challenging but the perspectives for a 
loophole-free test are promising. 

Bell tests based on the direct observation of 
atom-photon entang l ement have also been pr oposed 
(|Brunner et all 120071 ICabello and LarssonL l2007h . How- 



ever, the difficulty of efficiently collecting the photons 
emitted from the atom, and the relatively high detec- 
tion efficiencies required for the photon detection in or- 
der to close the detection loophole make this approach 
more delicate to implement. To overcome some of these 
problems, the use of continuous variable degrees of free- 
dom of the light field, combined with efficient homodyne 
measurements was recently explored ( Araujo et all l2012t 
ISangouard eTal\ , \2Qlli iTeo et all l2012t) . 



D. Bell tests without alignment 

Bell inequality violations in quantum mechanics re- 
quires the parties to perform judiciously chosen measure- 
ment settings on an entangled state. Experimentally, this 
requires a careful calibration of the measurement devices 
and alignment of a shared reference frame between the 
distant parties. Although such assumptions are typically 
made implicitly in theoretical works, establishing a com- 
mon reference frame, as well as aligning and calibrating 
measurement devices in experimental situations are never 
trivial issues. For instance, in quantum communications 
via optical fibres, unavoidable small temperature changes 
induce strong rotations of the polarization of photons in 
the fibre, which makes it challenging to maintain a good 
alignment. In turn this may considerably degrade the 
implementation of quantum protocols, such as Bell tests. 

This lead several authors to investigate whether Bell 
tests, and more generally quantum comm unication pro- 
tocols ( S.D. Bartlett and Spekkensl . l2007l ). could be real- 
ized without the need of a common reference frame. The 



first appro ach proposed was based on decoherence-free 
subspaces (|CabelloL 120031 ) . The experimental realization 



of such ideas is challenging as it requires high-dimensional 
entanglement, although progress was recently reported 
(|D'Ambrosio et aZ.I . l2012h . 

A more recent approach investigated Bell tests 
performed with randomly chosen measurements 
([Liang et al ], l2010t ). This first theoretical work consid- 
ered the CHSH Bell scenario, with qubit measurements 
chosen randomly and uniformly (according to the Haar 
measure) on the Bloch sphere, on a maximally entangled 
state of two-qubits. Somehow surprisingly, even when all 
four measurements are chosen at random, the probability 
of obtaining nonlocal correlations is ~ 28%. When un- 
biased measurements are used, this probability increases 
to ~ 42%. More recently it was shown however that if 
both parties perform three unbiased measurements (i.e. 
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forming an orthogonal triad on the Bloch sphere), the 
probability of v i olating a Bell inequality becomes o ne 
(Shadbolt et al ], 120121 : IWallman and Bartlettl l2012r iV 
This scheme was realized experimentally demonstrating 
the robustness of the effect to experi mental imperfec- 
tions such as loss es and finite s t atistic s ( Shadbolt et all 
l2012h (see also ( Palsson et all 120121 )). From a more 
conceptual point of view, these works are interesting 
as they show that quantum nonlocality is much more 
generic than previously thought. 



VIII. RELATED CONCEPTS 

A. Bi-locality and more general correlation scenarios 

In a tripartite Bell scenario, the standard definition of 
locality is given by 

p(abc\xyz) = / d\p(\)p\(a\x)p\(b\y)p\(c\z) (95) 



where A is a shared local random variable and J dXp(X) = 
1. This represents the most general model in which the 
outcome of each each observer is determined by his input 
and A. Since A is shared between all three parties, arbi- 
trary prior correlations can be established between the 
parties. 

In certain quantum experiments involving three sepa- 
rated observers, the distribution p(abc\xyz) is obtained 
by performing measurements on independent quantum 
states, originating from different sources. A typi- 
cal example is the pr otocol of entanglement swapping 
(jZukowski et all Il993| ) — also known as teleportation of 
entanglement — in which two systems that never inter- 
acted become entangled. Here, one party (say Bob), 
shares initially an entangled state with both Alice (de- 
noted system AB\) and Charlie (system B2C). That 
is, Alice and Bob share an entangled pair distributed by 
a source located between them, while a second source 
distributes entanglement between Bob and Charlie. Im- 
portantly, these two sources are completely independent 
from each other, hence systems AB\ and B^C share no 
prior correlations. It is then natural to ask whether the 
observed correlations p(abc\xyz) can be reproduced using 
a local model with the same feature, that is, in which sys- 
tems AB\ and B^G are initially uncorrelatcd. Formally 
such models can be written in the following form 

p(abc\xyz) = I dXdpp(X)p(p)px{a\x)px, fJ ,(b\y)p l _ l {c\z) 



(96) 

where Alice and Bob share the local random variable A, 
while Bob and Charlie share p. Since the variables A and 
p are assumed to be independent, their distribution fac- 
torizes, i.e. p(A, p) = p(X)p(p). The above condition is 
termed bilocality, and correlations satisfying it are called 
bilocal. It turns out that not all local correlations (i.e. of 



the form ([95])) can be written in the bilocal form. Hence 
the bilocality condition is a strictly stronger constraint 
than locality, it is then interesting to ask how to char- 
acterize the set of bilocal correlations, as this will lead 
to more stringent tests for revealing nonlocality in an 
entanglement swapping experiment. 

First exploratory works investigated this question in 
the context of the detectio n loophole ( Gisin and Gisinl . 
l2002tlZukowskT et al. I. I2008D. More recent l y, a systemati c 
approach was taken in (jBranciard et all |2010L |2012al) . 
In particular, these works present nonlinear inequali- 
ties for testing the bilocality condition, which are much 
more stringent compared to standard Bell inequalities. 
Note that the set of bilocal correlations is not convex 
in general, hence its characterization requires nonlinear 
inequalities. 

More generally, it is possible to consider an arbitrary 
correlation scenario, involving an arbitrary number of 
sources and observers, where certain systems can be ini- 
tially correlated while others are independent. Similarly 
to the above discussion, when two systems are assumed 
to be in dependent, they are described by a pro duct distri- 
bution (jBranciard et qZj . |2012al iFritzJ . l2012bl ). In fact, a 
typical Bell experiment can be viewed in this picture, fea- 
turing three independent sources. These are the source 
that produces the entangled state, the source generat- 
ing the measurement settings of Alice, and the source 
generating the setting of Bob. Indeed, if these sources 
arc not independent, the Bell violation is plagued by 
the measurement-independence loophole. A related ap- 
proach for considering locality in general correlation sce- 
narios usin g the formalism of causal networks was put 
forward in (jWood and Spekkensl . 120121 ). 



B. No-go theorems for nonlocal models 

Understandably, the study of nonlocal models is re- 
duced to few examples. On the one hand, it is obvious 
that some of these models will reproduce all observed 
correlations, so there is no hope of finding a result a la 
Bell which would falsify all of them. On the other hand, 
one needs to have a good motivation in order to propose 
a specific example of nonlocal model. In fact, basically 
two classes of models have been considered: wc review 
them briefly here, but refer to the original articles for a 
detailed justification of the interest of each model. 



1. Models a la Leggett 

Pure entangled states are characterized by the fact that 
the properties of the pair are well defined, but those of 
the individual subsystems are not. Consider for instance 
a maximally entangled state of two qubits. Although the 
global state has zero entropy, the state being pure, the re- 
duced state of each qubit is fully mixed thus having max- 
imum entropy. An interesting question is whether one 
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could devise alternative no-signaling models, reproduc- 
ing quantum correlations, in which the individual prop- 
erties are well-defined, or at least where the model gives a 
higher degree of predictability compared to quantum pre- 
dictions. The fi rst work in this direction was presented by 
(|Leggettl . l200l . who discussed a specific nonlocal model 
and proved its inability to reproduce quantum corrcla- 
tion s. Leggett's model was first tested experimentally 
by (|Groblacher et all 120071 ). A clear discussion of the 
scope and limitations o f this type of models is found in 
([Branciard et all I200&D . 

In a nutshell, the question is whether the probability 
distribution predicted by quantum theory pq can be seen 
as a convex combination of more fundamental distribu- 
tions: pq = J d\p\. Because of Bell's theorem, for some 
A at least, the distribution p\ must be nonlocal; but let 
us leave the correlations and their mechanism aside and 
concentrate on the marginals: we request that the p\ 
specify well defined individual properties. Focusing on 
the case of a maximally entangled qubit pair, Lcggett 
proposed a model in which the marginals take the form 
px(a\a) = i(l+au-a) andpA^I^) = h(l+bv-b). Here the 
the hidden variables consists of two vectors: A = (u,v). 
The intuition behind this model is the following. Lo- 
cally, say on Alice's side, the system behaves as if it had 
well defined polarization given by u. For a measurement 
direction a, the marginal distribution is then given by 
Malus' law. Hence this model makes more definite pre- 
dictions for individual properties compared to quantum 
theory. It turns out however that Leggett's model is un- 
able to reproduce quantum correlations. In particular, 
from the no-signaling condition and the above marginals, 
one can derive constraints, in the form of inequalities, on 
the correlations. Quantum predictions violate these in- 
equalities. Note that there is no direct relation between 
Leggett inequalities and Bell inequalities. In particular, 
the tests of Leggett inequalities known to date rely on 
the characterization of the measurement parameters and 
are therefore not device-independent, contrary to Bell in- 
equalities. 

Finally, not that more general models were also dis- 
cussed and demonstrated to be incompatible with quan- 
tum predictions. This shows that quantum correlations 
cannot be reproduced using no-signaling theories which 
make more accurate predictions of individual proper- 
ties compa red to quantum theory ( Colbeck and Rennerl . 
I2008L 12012ft 



2. Superluminal signaling models 

A second class of models addresses the possibility of 
explaining quantum correlation using a signaling mech- 
anism. Of course, this is problematic, because the signal 
should propagate faster than light: these models must 
thus postulate the existence of a preferred frame in which 
this signal propagates. Two cases have been considered. 

In the first one, the preferred frame is universal. From 



Bell's theorem, it follows that the speed v of the signal 
must be infinite. Clearly, one can find a model with v 
being infinite which reproduces all quantum correlations. 
On the other hand, the predictions of any signaling model 
where v is finite will differ from those of quantum me- 
chanics. For instance, in a bipartite Bell test, nonlocal 
correlations should vanish when the distance between the 
two observers exceeds a certain bound, since then there is 
simply not enough time for the signal to propagate. Ex- 
perimental investig ations could place lower bounds on v 
(ISalart et all. 120081 For a wide class of preferred frames, 
this bound exceeds the speed of light by several orders 
of magnitude. Furthermore, it is in fact possible to show 
theoretically that any signaling model in which v is fi- 
nite is incompatible with quantum predictions, by consid - 
ering a multipartite arrangement (jBancal et all l2012bl) . 
Specifically, for certain multipartite nonlocal quantum 
correlations to be recovered, the existence of a superlumi- 
nal signal cannot remain hidden, i.e. the model must nec- 
essarily break relativistic causality at the level of the ob- 
served statistics. Therefore, under the assumption that 
the observed statistics satisfy the no-signaling principle, 
quantum correlations cannot be reproduced by a model 
with finite speed signaling. 

In the second type of models, the rest frame of each 
measurement device is its own preferred frame. In this 
case, if the measurement devices moves away from one 
another, a before-before configuration can be created, in 
which each particle perceives that it is the first one to 
undergo the meas urement: then, nonlocal correlations 
should disappear ( Suarez and Scaranil Il997|). This pre- 
diction has been falsified experim entally ( Stefanov et all 
120021 l2003HZbinden et aZl . l200l[ ). 



C. Steering 

One of the most common ways of describing the effect 
of entanglement , noti ced already in the seminal paper 
of (ISchrodingerl . Il936h . uses the notion of steering: by 
making a measurement on her system, Alice can prepare 
at a distance Bob's state. More precisely, Alice cannot 
choose which state she prepares in each instance, because 
this would amount to signaling; however, if she sends to 
Bob the results of her measurements, Bob can check that 
indeed his conditional states had been steered. 

Though often invoked in the field, this notion had 
not been the objec t of de tailed studies until the work 
of ( Wiseman et all 12007ft . In this and subsequent 
work, these authors formalized steering as information- 
theoretic tasks, and pointed out how it differs from non- 
locality. Steering can be viewed as the distribution of 
entanglement from an untrusted party. Alice wants to 
convince Bob that she can prepare entanglement. Bob 
trusts his measurement device, hence he knows what ob- 
scrvables he is measuring on his system. However Bob 
does not trust Alice, whose device is then described by 
a black box. In this sense the task is intermediate be- 
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tween nonlocality (in which both Alice and Bob work 
with black boxes) and standard entanglement witnessing 
(in which both parties have perfect control of the observ- 
ables which are measured). The protocol works as fol- 
lows. Alice sends a quantum system to Bob, whose state, 
she claims, is entangled to her system. Upon receiving 
his system, Bob chooses a measurement setting (from a 
pre-determined set of measurements) to perform on it. 
He then informs Alice about his choice of measurement, 
and asks her to provide a guess for his measurement out- 
come. After repeating this procedure a sufficiently large 
number of times, Bob can estimate how strongly his sys- 
tem is correlated to that of Alice. If the correlations are 
strong enough, Bob concludes that the systems are in- 
deed entangled and that Alice did indeed steer his state. 

Interestingly, it turns out that entanglement is nec- 
essary but not sufficient for steering, while steering is 
necessary but not sufficient for nonlocality. Hence, steer- 
ing represents a novel form of inseparability in quantum 
mechanic s, intermediate b etwee n entanglement a n d non - 
locality (|Saunders et all \20lA IWiseman etail 120071 ). 
The quantitative relation between steering, entangle- 
ment, and Bell nonlocality is yet to be understood. 

Experimentally, steering can be tested using steering 
inequalities, similar to Bell inequalities. The first steer- 
ing criterion were derived for continuous variabl e sys- 



tems , mostly based on variances of ob servables (IReid 



1989) and e ntropic unce r tainty relations (jWalborn et al 



20111 ); see (IReid et all l2009h for a review. More re 



ccntly, st eering inequalities were presented for discrete 
variables ( Cavalcanti et all I2009I ). All these tests were 
investigated experimentally. Similarly to Bell tests, ex- 
perimental steering tests suffer from loopholes. Neverthe- 
less, closing these loopholes appears to be less demanding 
compared to Bell tests, in particular in terms of detec- 
tion efficiency. A loophole-free steering e xperiment was 
recently reported ( Wittmann et aUl2012l ). 



be reproduced by a local model. Hence, semi-quantum 
games highlight a nonlocal aspect of quantum states, 
which is however different from Bell nonlocality. More 
recently it was shown that semi-quantum games and en- 
tanglement witnesses are intimately related. In particu- 
lar, for detecting an entangled state p, a semi-quantum 
game can be constru cted directly from an en tanglement 
witness detecting p (jBranciard et al. I. l2012bl) . Applica- 
tions of these ideas for the detect ion of entanglement and 
steering were recently di scussed ( Branciard et a/.l . l2012bt 
ICavalcanti et al. I. l2012cf> . 
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Appendix: Appendix: guide to Bell inequalities 

The goal of this section is to orientate the reader look- 
ing for a particular type of Bell inequality. Here we clas- 
sify Bell inequalities according to the number of parties 
n, the number of measurements for each party m, and the 
number of outcomes d 20 . Below, for any Bell test sce- 
nario given by the triple (n,m,d), we provide references 
to articles presenting relevant Bell inequalities. Note that 
we do not give the inequalities explicitly; some of these 
can nevertheless be found in parts of this review, in par- 
ticular in Sections II and V. 



D. Semi-quantum games 

In the usual Bell test scenario, distant parties share a 
quantum state distributed from a source. The local mea- 
surements and their outcomes are represented by classical 
data. In a recent paper, Buscemi proposed a variant of 
Bell tests, termed semi-quantum games, in whi ch the in- 
puts of each party are given by quantum states ( Buscemi 
I2012D . That is, each party holds a black box in which 
the observer inputs a quantum state. Inside the box, 
this quantum input is then jointly measured with the 
quantum system coming from the source, and a classical 
output is produced. In case the input quantum states are 
orthogonal (hence perfectly distinguishable) , the setup is 
simply equivalent to a standard Bell test with classical in- 
puts. However, when the states are not mutually orthog- 
onal, the setup becomes more general. Buscemi showed 
that, in this case, for any entangled state p there ex- 
ists a semi-quantum game, the statistics of which cannot 



1. Bipartite Bell inequalities 

a. Binary outputs: (2, m, 2) 

For the case m = 2, CHSH is the only tight Bell in- 
equality. Note that if one of the parties has more mea- 
surement, i.e. VfiA = 2 and nig = m, CHSH is still known 
to be the only tight inequality. 

For m = 3, one additional tight inequality arises: 
-^3322 (see Section 2) , dis covered independently by 
Froissar d (iFroissardl Il98lh and Collins and Gisin 
(ICollins and Gisinl . l2004ri ~ 



20 Note that for Bell inequalities featuring different number of mea- 
surements or outcomes for different parties, m and d represent 
the maximum values. 
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For m = 4, 5, the complete list of tight Bell inequali- 
ties is unf ortunately no t known. Incomplete lists c an be 
found in (IBancal et all l20ld : iBrunner and Gisinl . 120081: 
iPal and Vertesil . l2009h . Note that for m A = 4 and 
iub = 3, the complete list of tigh t Bell inequalities was 
given in (jCollins and Gisinl . l2004h . 

For m > 6, much less is known. Incomplete lists of 
facets could be derive d using sophistic ated techniques 
from convex geometry ( Avis et all . 120041 ). 

For arbitrary valu es of m, the family of i nequalities 
J mm 22 introduced in ( Collins and Gisirl 2004) turns out 
to be tight in general ( Avis and ltd . 20071 ). It is also 
worth mentioning the family of chained Bell inequalities 
( Braunstein and CavesL Il990l ). although these inequali- 
ties are not tight for m > 3. 



b. Arbitrary number of outputs: (2,m,d) 

In the case m = 2 a nd d = 3, the inequality of CGLMP 
( Collins et al ], l2002al ) is the only tight inequality addi- 
tional to CHSH. For d > 4, the CGLMP inequality is 
known to be tight, bu t there exist additional facets in 
this case ( Bancal et al 1 120101) . 

Note that the chained Bell inequaliti e s hay e been gen- 
eralized to this scenario ( Barrett et al ., 2007). Whether 
they are tight or not for d > 3 is not known. 



Note also that there exist functional Bell inequalities 
which can be defined for an infinite number of settings 
(ISenfPel et all l2002ft . 



2. Multipartite Bell inequalities 

a. Binary outputs: (n, m, 2) 

All tight correlation Bell inequalities are known 
for the case m = 2 (IWerner and WolA 120021: 
IZukowski and BrukneA l2002j ). Indeed, this set contains 
the inequ alities of Mermin-Ardehali-Belinski-K l yshko 
(MABK ) (lArdehalil . Il992t iBelinskii and Klvshkol Il993t 
lMerminl . [l990ah . 

For non-correlation Bell inequality, the complete set 
of tight Bell inequa l ities in the case n = 3 and m = 2 
was given in (ISliwal. 120031 ). For arbitrary n and m, Ref. 
(jLaskowski et all 120041 ) provided tight Bell inequalities. 



b. Arbitrary number of outputs: (n,m,d) 

A general family of Bell i nequalities based on var iance 
inequalities was derive d in (ICavalcanti et "all 120071) . and 
further developed in (|He et all l2009h . More generally 
the se inequalities are particular cases of inequalities and 
in ( Salles et al ., 2010). Note that these inequalities are 
not tight as they are not linear. However, they can be 
conveniently used for continuous variables (i.e. d = oo). 

A generalization of the Mermi n inequalities for t he sce - 
nario (3,2,d) was presented in ( Grandiean et all 120121) : 
these inequalities are known to be tight for d < 8. For 
n = 3, m = 2 and d = 3, 4, 5, tight inequalities were given 
in (|Chen et all 120081) . 



c. Bell inequalities detecting genuine multipartite nonlocality 

Svetlichny derived the first i nequality for test ing gen- 
uine multipartite nonlocality ( Svetlichnvl . Il987l) in the 
case (3,2,2). This inequality was later generalized to 
an arbit rary number of part i es, e.g. to the scenario 
(n,2 2) dCollins et all l2002bt ISeevinck and Svetlichnvl . 
120021 ). Ref. (|Bancal et all 12010^ also introduced more 
inequalities for the simplest case (3, 2, 2). 

Svetlichny's inequality was also generaliz e d to a more 
general scenario: (n,2,d) in (iBancal et all l2011al) and 



(n, m, d) in (IBancal et al 



In Ref. (Ba rrett et al. 



2012al) . 

I l201ll) . the authors discussed 
various definitions of the concept of genuine multipartite 
nonlocality, and introduced inequalities for testing each 
of these models in the scenario (3, 2, 2). 
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